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1 CAXTTCCCAA AGACAAAato aattttcaao tccaaatttt caocttecta 

51 cta»tc»q*si cgtcflqtcat aatntccww qgacaaattg ttctcaccca 

101 gtctccagca atcatgtctg catctccagg ggagaaggtc accatgacct 

1S1 gcagtgccag ctcaagtgta agttacatga actggtacca gcagaagtca 

201 ggcacctccc ccaaaagatg gatttatgae acatccaaac tggcttctgg 

.391 agtccctgct cacttcaggg gcagtgggtc tgggacctct tact etc tea 

a 01 caatcagegg catggaggct gaagatgctg ccacttatfca ctgocagcag 

351 tggagtagta acccattcac gtteggcteg gggacaaagt tggaaataaa 

401 cegggctgat aetgeaccaa ctgt*tccat ctteccaeca tccagtgagc 

451 agttaacatc tggaggtgce tcagtcgrgr gctccttgaa caacttctac 

501 cccaasgaca tcaatgteaa gtggaagatt gatggcagtg aacgacaaaa 

551 t ggcgt.ee tg aacagttgga ctgatcagga cagcaaagac agcacctaca 

601 gcatgagcag caccctcacg ttgaccaagg aegagtatga acgacataac 

651 agctatacct gtgaggecac teacaagaca tcaacttcae ceattgtcaa 

701 gagcttcaac aggaatgagt gtTACACACA AACCTCCTCA CACCCCACCA 

751 OCAGCTCCCA CCTCCATCCT ATCTTCCCTT CTAAGCTCTT GGACCCTTCC 

601 CCACAACCGC tTACCACTGT TCCCCTCCTC tAAACCTCCT CCCACCTCCT 

851 TCTCCTCCTC CTCCCTTTCC TTSGCTTTTA TCATGCTAAT ATTTCCAGAA 

901 AATATTCAAT AAACTGACTC TTTGCCTTGA AAAAAAAAAA AAA 

Fig. 1(a) 



1 MPFWPTrsr lATSASVTTS RfiOTVLTOSP A1MSASPCEX VTMTCSASSS 

51 VSYMNWYQQK SCTSPKWIT DTSKLASCVP AHFJtGSGSCT SXSVTXSGHE 

101 AXCAATYYCQ QWSSHPFTFC SGTKLEIKRA DTAPTVSIFP PSSEQLTSGG 

151 ASWCFLHNF YFKOINVKWE IDGSERQHCV 1H5CTDQDSX DSTVSKS5TX. 

201 TLTKOEYERH NSYTCEATHX TSTSPIVKSr KRHEC* 

Fig. Kb) 
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Raid of the Invention 

The present invention relates to humanised antibody molecules, to processes for their production using 
recombinant DNA technology, and to their therapeutic uses. 

s The term "humanised antibody molecule" in used to describe a molecule having an antigen binding site 
derived from an immunoglobulin from a non-human species, and remaining immunoglobufin-derived parts of 
the molecule being derived from a human immunoglobulin. The antigen binding site typically comprises 
complementarity determining regions (CDRs) which determine the binding specificity of the antibody 
molecule and which are carried on appropriate framework regions In the variable domains. There are 3 

w CDRs (CDR1, CDR2 and CDR3) in each of the heavy and light chain variable domains. 

In the description, reference is made to a number of publications by number. The publications are listed 
in numerical order at the end of the description. 

Background of the Invention 

16 

Natural immunoglobulins have been known for many years, as have the various fragments thereof, such 
as the Fab, (Fab'k and Fc fragments, which can be derived by enzymatic cleavage. Natural im- 
munoglobulins comprise a generally Y-shaped molecule having an antigen-binding site towards the end of 
each upper arm. The remainder of the structure, and particularly the stem of the Y. mediates the effector 

20 functions associated with immunoglobulins. 

Natural immunoglobulins have been used in assay, diagnosis and. to a more limited extent therapy. 
However, such uses, especially in therapy, were hindered until recently by the polyclonal nature of natural 
immunoglobulins. A significant step towards the realisation of the potential of immunoglobulins as therapeu- 
tic agents was the discovery of procedures for the production of monoclonal antibodies (MAbs) of defined 

25 specificity (1). 

However, most MAbs are produced by hybrid om as which are fusions of rodent spleen cells with rodent 
myeloma cells. They are therefore essentially rodent proteins. There are very few reports of the production 
of human MAbs. 

Since most available MAbs are of rodent origin, they are naturally antigenic in humans and thus can 
30 give rise to an undesirable immune response termed the HAMA (Human Anti-Mouse Antibody) response. 
Therefore, the use of rodent MAbs as therapeutic agents in humans is inherently limited by the fact that the 
human subject will mount an immunological response to the MAb and will either remove it entirely or at 
least reduce its effectiveness. In practice, MAbs of rodent origin may not be used in patients for more than 
one or a few treatments as a HAMA response soon develops rendering the MAb ineffective as well as 
ss giving rise to undesirable reactions. For instance, OKT3 a mouse lgG2a/k MAb which recognises an antigen 
in the T-cell receptor-CD3 complex has been approved for use in many countries throughout the world as 
an immunosuppressant in the treatment of acute allograft rejection [Chatenoud et al (2) and Jeffers et al (3)- 
]. However, in view of the rodent nature of this and other such MAbs, a significant HAMA response which 
may include a major anti-idiotype component, may build up on use. Clearly, it would be highly desirable to 
40 diminish or abolish this undesirable HAMA response and thus enlarge the areas of use of these very useful 
antibodies. 

Proposals have therefore been made to render non-human MAbs less antigenic in humans. Such 
techniques can be generically termed "hurnanisation* techniques. These techniques typically involve the 
use of recombinant DNA technology to manipulate DNA sequences encoding the polypeptide chains of the 
45 antibody molecule. 

Early methods for humanising MAbs involved production of chimeric antibodies in which an antigen 
binding site comprising the complete variable domains of one antibody is linked to constant domains 
derived from another antibody. Methods for carrying out such chimerisation procedures are described in 
EP0120694 (Celltech Limited). EP0125023 (Genentech Inc. and City of Hope). EP-A-0 171496 (Res. Dev. 

so Corp. Japan), EP-A-0 173 494 (Stanford University), and WO 86/01533 (Celltech Limited). This latter 
Celltech application (WO 86/01533) discloses a process for preparing an antibody molecule having the 
variable domains from a mouse MAb and the constant domains from a human immunoglobulin. Such 
humanized chimeric antibodies, however, still contain a significant proportion of non-human amino acid 
sequence, i.e. the complete non-human variable domains, and thus may still elicit some HAMA response. 

55 particularly if administered over a prolonged period [Begent et al (ref. 4)]. 

In an alternative approach, described in EP-A-0239400 (Winter), the complementarity determining 
regions (CDRs) of a mouse MAb have been grafted onto the framework regions of the variable domains of a 
human immunoglobulin by site directed mutagenesis using long oligonucleotides. The present invention 
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relates to humanized antibody molecules prepared according to this alternative approach, i.e. CDR-grafted 
humanised antibody molecules. Such CDR-grafted humanized antibodies are much less likety to give rise to 
a KAMA response than humanised chimeric antibodies in view of the much lower proportion of non-human 
amino acid sequence which they contain. 

6 The earliest work on humanizing MAbs by CDR-grafUng was carried out on MAbs recognizing synthetic 
antigens, such as the NP or NIP antigens. However, examples in which a mouse MAb recognizing tysozyme 
and a rat MAb recognising an antigen on human T-ceUs were humanised by CDR-grafting have been 
described by Verhoeyen et al (5) and Riechmann et at (6) respectively. The preparation of CDR-grafted 
antibody to the antigen on human T ceils is also described in WO 89/07452 (Medical Research Council). 

io In Riechmann et al/ Medical Research Council it was found that transfer of the CDR regions alone [as 
defined by Kabat refs. (7) and (8)] was not sufficient to provide satisfactory antigen binding activity in the 
CDR-grafted product. Riechmann et al found that it was necessary to convert a serine residue at position 27 
of the human sequence to the corresponding rat phenylalanine residue to obtain a CDR-grafted product 
having improved antigen binding activity. This residue at position 27 of the heavy chain is within the 

is structural loop adjacent to CDR1 . A further construct which additionally contained a human serine to rat 
tyrosine change at position 30 of the heavy chain did not have a significantly altered binding activity over 
the humanised antibody with the serine to phenylalanine change at position 27 alone. These results indicate 
that changes to residues of the human sequence outside the CDR regions, in particular in the structural 
loop adjacent to CDR1. may be necessary to obtain effective antigen binding activity for CDR-grafted 

20 antibodies which recognise more complex antigens. Even so the binding affinity of the best CDR-grafted 
antibodies obtained was still significantly less than the original MAb. 

Very recently Queen et al (9) have described the preparation of a humanised antibody that binds to the 
interleukin 2 receptor, by combining the CDRs of a murine MAb (anti-Tac) with human immunoglobulin 
framework and constant regions. The human framework regions were chosen to maximise homology with 

2s the anti-Tac MAb sequence. In addition computer modelling was used to identify framework amino acid 
residues which wore likely to interact with the CDRs or antigen, and mouse amino acids were used at these 
positions in the humanised antibody. 

In WO 90/07861 Queen et al propose four criteria for designing humanised immunoglobulins. The first 
criterion is to use as the human acceptor the framework from a particular human immunoglobulin that is 

30 unusually homologous to the non-human donor immunoglobulin to be humanised, or to use a consensus 
framework from many human antibodies. The second criterion is to use the donor amino acid rather than 
the acceptor if the human acceptor residue is unusual and the donor residue is typical for human 
sequences at a specific residue of the framework. The third criterion is to use the donor framework amino 
acid residue rather than the acceptor at positions immediately adjacent to the CDRs. The fourth criterion is 

35 to use the donor amino acid residue at framework positions at which the amino acid is predicted to have a 
side chain atom within about 3 A of the CDRs in a three-dimensional immunoglobulin model and to be 
capable of interacting with the antigen or with the CDRs of the humanised immunoglobulin. It is proposed 
that criteria two, three or four may be applied in addition or alternatively to criterion one, and may be 
applied singly or in any combination. 

40 WO 90/07861 describes in detail the preparation of a single CDR-grafted humanised antibody, a 
humanised antibody having specificity for the p55 Tac protein of the IL-2 receptor. The combination of all 
four criteria, as above, were employed in designing this humanized antibody, the variable region frame- 
works of the human antibody Eu (7) being used as acceptor. In the resultant humanised antibody the donor 
CDRs were as defined by Kabat et al (7 and 8) and in addition the mouse donor residues were used in 

45 place of the human acceptor residues, at positions 27, 30. 48. 66, 67, 89, 91, 94. 103, 104, 105 and 107 in 
the heavy chain and at positions 48, 60 and 63 in the light chain, of the variable region frameworks. The 
humanised anti-Tac antibody obtained is reported to have an affinity for p55 of 3 x 10 9 M~\ about one-third 
of that of the murine MAb. 

We have further investigated the preparation of CDR-grafted humanised antibody molecules and have 

50 identified a hierarchy of positions within the framework of the variable regions (i.e. outside both the Kabat 
CDRs and structural loops of the variable regions) at which the amino acid identities of the residues are 
important for obtaining CDR-grafted products with satisfactory binding affinity. This has enabled us to 
establish a protocol for obtaining satisfactory CDR-grafted products which may be applied very widely 
irrespective of the level of homology between the donor immunoglobulin and acceptor framework. The set 

55 of residues which we have identified as being of critical importance does not coincide with the residues 
identified by Queen et al (9). 
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Summary of the Invention 

Accordingly, in a first aspect the invention provides a CDR-grafted antibody heavy chain having a 
variable region domain comprising acceptor framework and donor antigen binding regions wherein the 

s framework comprises donor residues at at least one of positions 6. 23 and/or 24, 48 and/or 49. 71 and/or 
73. 75 and/or 76 and/or 78 and 88 and/or 91 . 

In preferred embodiments, the heavy chain framework comprises donor residues at positions 23, 24, 49, 
71, 73 and 78 or at positions 23, 24 and 49. The residues at positions 71 , 73 and 78 of the heavy chain 
framework are preferably either all acceptor or all donor residues. 

io In particularly preferred embocGments the heavy chain framework additionally comprises donor residues 
at one, some or ail of positions 6, 37, 48 and 34. Also it is particularly preferred that residues at positions of 
the heavy chain framework which are commonly conserved across species. i.e. positions 2, 4, 25, 36, 39, 
47, 93. 103, 104, 106 and 107, if not conserved between donor and acceptor, additionally comprise donor 
residues. Most preferably the heavy chain framework additionally comprises donor residues at positions 2, 

is 4. 6, 25, 36. 37. 39, 47, 48, 93. 94. 103, 104, 106 and 107. 

In addition the heavy chain framework optionally comprises donor residues at one, some or all of 
positions: 
1 and 3, 
72 and 76. 

20 69 (if 48 is different between donor and acceptor). 
38 and 46 Of 46 is the donor residue), 
80 and 20 (if 69 is the donor residue), 
67, 

82 and 18 (if 67 is the donor residue), 
25 91. 
88, and 

any one or more of 9, 11 , 41, 87, 108, 110 and 1 12. 

In the first and other aspects of the present invention reference is made to CDR-grafted antibody 
products comprising acceptor framework and donor antigen binding regions. It will be appreciated that the 
30 invention is widely applicable to the CDR-grafting of antibodies in general. Thus, the donor and acceptor 
antibodies may be derived from animals of the same species and even same antibody dass or sub-class. 
More usually, however, the donor and acceptor antibodies are derived from animals of different species. 
Typically the donor antibody is a non-human antibody, such as a rodent MAb. and the acceptor antibody is 
a human antibody. 

os In the first and other aspects of the present invention, the donor antigen binding region typically 
comprises at least one CDR from the donor antibody. Usually the donor antigen binding region comprises 
at least two and preferably all three CDRs of each of the heavy chain and/or fight chain variable regions. 
The CDRs may comprise the Kabat CDRs. the structural loop CDRs or a composite of the Kabat and 
structural loop CDRs and any combination of any of these. Preferably, the antigen binding regions of the 

40 CDR-grafted heavy chain variable domain comprise CDRs corresponding to the Kabat CDRs at CDR2 
(residues 50-65) and CDR3 (residues 95-100) and a composite of the Kabat and structural loop CDRs at 
CDR1 (residues 26-35). 

The residue designations given above and elsewhere in the present appPcation are numbered accord- 
ing to the Kabat numbering [refs. (7) and (8)]. Thus the residue designations do not always correspond 

45 directly with the linear numbering of the amino acid residues. The actual linear amino acid sequence may 
contain fewer or additional amino acids than in the strict Kabat numbering corresponding to a shortening of. 
or insertion into, a structural component, whether framework or CDR, of the basic variable domain structure. 
For example, the heavy chain variable region of the anti-Tac antibody described by Queen et_al (9) contains 
a single amino acid insert (residue 52a) after residue 52 of CDR2 and a three amino acid insert (residues 

so 82a. 82b and 82c) after framework residue 82, in the Kabat numbering. The correct Kabat numbering of 
residues may be determined for a given antibody by alignment at regions of homology of the sequence of 
the antibody with a "standard" Kabat numbered sequence. 

The invention also provides in a second aspect a CDR-grafted antibody light chain having a variable 
region domain comprising acceptor framework and donor antigen binding regions wherein the framework 

55 comprises donor residues at at least one of positions 1 and/or 3 and 46 and/or 47. Preferably the CDR 
grafted light chain of the second aspect comprises donor residues at positions 46 and/or 47. 

The invention also provides in a third aspect a CDR-grafted antibody light chain having a variable region 
domain comprising acceptor framework and donor antigen binding regions wherein the framework com- 
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prises donor residues at at least one of positions 46, 48, 58 and 71. 

In a preferred embodiment of the third aspect, the framework comprises donor residues at all of 

positions 46, 48, 58 and 71. 

In particularly preferred embodiments of the second and third aspects, the framework additionally 
s comprises donor residues at positions 36. 44, 47. 85 and 87. Similarly positions of the tight chain framework 

which are commonly conserved across species, i.e. positions 2, 4, 6, 35, 49, 62, 64-69, 98, 99, 101 and 

102, if not conserved between donor and acceptor, additionally comprise donor residues. Most preferably 

the light chain framework additionally comprises donor residues at positions 2, 4. 6, 35, 36. 38, 44, 47, 49, 

62. 64-69, 85, 87, 98, 99, 101 and 102. 
w In addition the framework of the second or third aspects optionally comprises donor residues at one, 

some or all of positions: 

1 and 3. 

63, 

60 (if 60 and 54 are able to form at potential saltbridge), 
75 70 (if 70 and 24 are able to form a potential saltbridge), 
73 and 21 (if 47 is different between donor and acceptor), 
37 and 45 (if 47 is different between donor and acceptor), 
and 

any one or more of 10, 12, 40, 80, 103 and 105. 
20 Preferably, the antigen binding regions of the CDR-grafted light chain variable domain comprise CDRs 
corresponding to the Kabat CDRs at CDR1 (residue 24-34), CDR2 (residues 50-58) and CDR3 (residues 89- 
97). 

The Invention further provides in a fourth aspect a CDR-grafted antibody molecule comprising at least 
one CDR-grafted heavy chain and at least one CDR-grafted light chain according to the first and second or 

25 first and third aspects of the invention. 

The humanised antibody molecules and chains of the present invention may comprise: a complete 
antibody molecule, having full length heavy and light chains; a fragment thereof, such as a Fab, (Fab')2 or 
FV fragment; a light chain or heavy chain monomer or dimer; or a single chain antibody, e.g. a single chain 
FV in which heavy and light chain variable regions are joined by a peptide linker; or any other CDR-grafted 

30 molecule with the same specificity as the original donor antibody. Similarly the CDR-grafted heavy and light 
chain variable region may be combined with other antibody domains as appropriate. 

Also the heavy or light chains or humanised antibody molecules of the present invention may have 
attached to them an effector or reporter molecule. For instance, it may have a macrocycle, for chelating a "7 
heavy metal atom, or a toxin, such as ricin, attached to it by a covalent bridging structure. Alternatively, the - 

35 procedures of recombinant DNA technology may be used to produce an immunoglobulin molecule in which 
the Fc fragment or CH3 domain of a complete immunoglobulin molecule has been replaced by, or has 
attached thereto by peptide linkage, a functional non-immunogtobulin protein, such as an enzyme or toxin 
molecule. 

Any appropriate acceptor variable region framework sequences may be used having regard to 

40 class/type of the donor antibody from which the antigen binding regions are derived. Preferably, the type of 
acceptor framework used is of the same/similar class/type as the donor antibody. Conveniently, the 
framework may be chosen to maximise/optimise homology with the donor antibody sequence particularly at 
positions close or adjacent to the CDRs. However, a high level of homology between donor and acceptor 
sequences is not important for application of the present invention. The present invention identities a 

45 hierarchy of framework residue positions at which donor residues may be important or desirable for 
obtaining a CDR-grafted antibody product having satisfactory binding properties. Tne COR-g rafted products 
usually have binding affinities of at least 10 s M~\ preferably at least about 10 s M~\ or especially in the 
range 10M0 12 M~ 1 . In principle, the present invention is applicable to any combination of donor and 
acceptor antibodies irrespective of the level of homology between their sequences. A protocol for applying 

so the invention to any particular donor-acceptor antibody pair is given hereinafter. Examples of human 
frameworks which may be used are KOL. NEWM, REI, EU. LAY and POM (refs. 4 and 5) and the like; for 
instance KOL and NEWM for the heavy chain and REI for the light chain and EU. LAY and POM for both 
the heavy chain and the light chain. 

Also the constant region domains of the products of the invention may be selected having regard to the 

55 proposed function of the antibody in particular the effector functions which may be required. For example, 
the constant region domains may be human IgA, IgE, IgG or IgM domains. In particular, IgG human 
constant region domains may be used, especially of the IgGI and lgG3 tsotypes, when the humanised 
antibody molecule is intended for therapeutic uses, and antibody effector functions are required. Alter- 
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natively, lgG2 and lgG4 isotypos may be used when the humanised antibody molecule is intended for 
therapeutic purposes and antibody effector functions are not required, e.g. for simple blocking of lym- 
phokine activity. 

However, the remainder of the antibody molecules need not comprise only protein sequences from 
s immunoglobulins. For instance, a gene may be constructed in which a DNA sequence encoding part of a 
human immunoglobulin chain is fused to a DNA sequence encoding the amino acid sequence of a 
functional polypeptide such as an effector or reporter molecule. 

Preferably the CDR-grafted antibody heavy and fight chain and antibody molecule products are 
produced by recombinant DNA technology. 
io Thus in further aspects the invention also includes DNA sequences coding for the CDR-grafted heavy 
and light chains, cloning and expression vectors containing the DNA sequences, host cells transformed with 
the DNA sequences and processes for producing the CDR-grafted chains and antibody molecules 
comprising expressing the DNA sequences in the transformed host cells. 

The general methods by which the vectors may be constructed, transection methods and culture 
/s methods are well known per se and form no part of the invention. Such methods are shown, for instance, in 
references 10 and 11. 

The DNA sequences which encode the donor amino acid sequence may be obtained by methods well 
known in the art. For example the donor coding sequences may be obtained by genomic cloning, or cDNA 
cloning from suitable hybridoma cell lines. Positive clones may be screened using appropriate probes for 

20 the heavy and tight chain genes in question. Also PCR cloning may be used. 

DNA coding for acceptor, e.g. human acceptor, sequences may be obtained in any appropriate way. 
For example DNA sequences coding for preferred human acceptor frameworks such as KOL, RB» EU and 
NEWM, are widely available to workers in the art 

The standard techniques of molecular biology may be used to prepare DNA sequences coding for the 

25 CDR-grafted products. Desired DNA sequences may be synthesised completely or In part using 
oligonucleotide synthesis techniques. Site-directed mutagenesis and polymerase chain reaction (PCR) 
techniques may be used as appropriate. For example oligonucleotide directed synthesis as described by 
Jones et al (ret 20) may be used. Also oligonucleotide directed mutagenesis of a pre-exising variable 
region as. for example, described by Verhoeyen et al (ref . 5) or Riechmann et al (ref. 6) may be used. Also 

so enzymatic filling in of gapped oligonucleotides using T* DNA polymerase as, for example, described by 
Queen etal (ref. 9) may be used. 

Any suitable host cell/vector system may be used for expression of the DNA sequences coding for the 
CDR-grafted heavy and light chains. Bacterial e.g. E. coli , and other microbial systems may be used, in 
particular for expression of antibody fragments such as FAb and (Fab*)2 fragments, and especially FV 

as fragments and single chain antibody fragments. e.g. single chain FVs. Eucaryotic e.g. mammalian host cell 
expression systems may be used for production of larger CDR-grafted antibody products, including 
complete antibody molecules. Suitable mammalian host cells include CHO cells and myeloma or hybridoma 
cell lines. 

Thus, in a further aspect the present invention provides a process for producing a CDR-grafted antibody 
40 product comprising: 

(a) producing in an expression vector an operon having a DNA sequence which encodes an antibody 
heavy chain according to the first aspect of the invention; 

and/or 

(b) producing in an expression vector an operon having a DNA sequence which encodes a complemen- 
ts tary antibody light chain according to the second or third aspect of the invention; 

(c) transfecting a host cell with the or each vector; and 

(d) culturing the transfected celt line to produce the CDR-grafted antibody product 

The CDR-grafted product may comprise only heavy or light chain derived polypeptide, in which case 
only a heavy chain or light chain polypeptide coding sequence is used to transfect the host cells. 

50 For production of products comprising both heavy and light chains, the cell line may be transfected with two 
vectors, the first vector may contain an operon encoding a light chain-derived polypeptide and the second 
vector containing an operon encoding a heavy chain-derived polypeptide. Preferably, the vectors are 
identical, except in so far as the coding sequences and selectable markers are concerned, so as to ensure 
as far as possible that each polypeptide chain is equally expressed. Alternatively, a single vector may be 

55 used, the vector including the sequences encoding both light chain- and heavy chain-derived polypeptides. 

The DNA in the coding sequences for the light and heavy chains may comprise cDNA or genomic DNA 
or both. However, it is preferred that the DNA sequence encoding the heavy or light chain comprises at 
least partially, genomic DNA, preferably a fusion of cDNA and genomic DNA. 
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The present invention is applicable to antibodies of any appropriate specificity. Advantageously, 
however, the invention may be applied to the humanisation of non-human antibodies which are used for in 
vivo therapy or diagnosis. Thus the antibodies may be site-specific antibodies such as tumour-specific or 
ce¥ surface-specific antibodies, suitable for use in in vivo therapy or diagnosis, e.g. tumour imaging. 
Examples of cell surface-specific antibodies are anti-T cell antibodies, such as anti-CD3. and CD4 and 
adhesion molecules, such as CR3, ICAM and ELAM. The antibodies may have specificity for interfeukins 
(including lymphokines, growth factors and stimulating factors), hormones and other biologically active 
compounds, and receptors for any of these. For example, the antibodies may have specificity for any of the 
following: Interferons a. 0. y or 5. IL1 , IL2. IL3. or IL4, etc., TNF, GCSF. GMCSF, EPO, hGH, or insulin, etc. 

The the present invention also includes therapeutic and diagnostic compositions comprising the CDR- 
grafted products of the invention and uses of such compositions in therapy and diagnosis. 

Accordingly in a further aspect the invention provides a therapeutic or diagnostic composition compris- 
ing a CDR-grafted antibody heavy or light chain or molecule according to previous aspects of the invention 
in combination with a pharrnaceutically acceptable carrier, diluent or excipient 

Accordingly also the invention provides a method of therapy or diagnosis comprising administering an 
effective amount of a CDR-grafted antibody heavy or light chain or molecule according to previous aspects 
of the invention to a human or animal subject. 

A preferred protocol for obtaining CDR-grafted antibody heavy and light chains in accordance with the 
present invention is set out below together with the rationale by which we have derived this protocol. This 
protocol and rationale are given without prejudice to the generality of the invention as hereinbefore 
described and defined 

Protocol 

It is first of all necessary to sequence the DNA coding tor the heavy and light chain variable regions of 
the donor antibody, to determine their amino acid sequences. It is also necessary to choose appropriate 
acceptor heavy and Bght chain variable regions, of known amino acid sequence. The CDR-grafted chain is 
then designed starting from the basis of the acceptor sequence. It will be appreciated that in some cases 
the donor and acceptor amino acid residues may be identical at a particular position and thus no change of 
acceptor framework residue is required. 

1 . As a first step donor residues are substituted for acceptor residues in the CDRs. For this purpose the 
CDRs are preferably defined as follows: 

Heavy chain - CDR1: residues 26-35 

- CDR2: residues 50-65 

- CDR3: residues 95-102 
Light chain - CDR1 : residues 24-34 

- CDR2: residues 50-56 

- CDR3: residues 89-97 

The positions at which donor residues are to be substituted for acceptor in the framework are then 
chosen as follows, first of all with respect to the heavy chain and subsequently with respect to the light 
chain. 

2. Heavy Chain 

2.1 Choose donor residues at all of positions 23, 24, 49, 71, 73 and 78 of the heavy chain or all of 
positions 23, 24 and 49 (71 , 73 and 78 are always either all donor or alt acceptor). 

2.2 Check that the following have the same amino acid in donor and acceptor sequences, and if not 
preferably choose the donor: 2. 4, 6, 25. 36. 37, 39. 47, 48. 93. 94. 103. 104, 106 and 107. 

2.3 To further optimise affinity consider choosing donor residues at one. some or any of: 

i. 1.3 

ii. 72.76 

iii. If 48 is different between donor and acceptor sequences, consider 69 

iv. If at 46 the donor residue is chosen, consider 38 and 46 

v. If at 69 the donor residue is chosen, consider 80 and then 20 

vi. 67 

vii. If at 67 the donor residue is chosen, consider 82 and then 1 8 

viii. 91 

ix. 88 

x. 9, 11,41,87. 108. 110, 112 

3. Light Chain 
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3.1 Choose donor at 46, 48, 58 and 71 

3.2 Check that the following have the same amino acid in donor and acceptor sequences, if not 
preferably choose donor: 

2, 4. 6, 35, 38. 44, 47. 49, 62, 64-69 inclusive. 85. 87, 98. 99, 101 and 102 
s 3.3 To further optimise affinity consider choosing donor residues at one, some or any of: 

i. 1 f 3 

ii. 63 

iii. 60, if 60 and 54 are able to form potential saltbridge 

iv. 70, if 70 and 24 are able to form potential saltbridge 

to v. 73, and 21 if 47 is different between donor and acceptor 

vi. 37, and 45 if 47 is different between donor and acceptor 

vii. 10, 12, 40, 80. 103, 105 

Rationale 

15 

In order to transfer the binding site of an antibody into a different acceptor framework, a number of 
factors need to be considered. 

1. The extent of the CDRs 

The CDRs (Complementary Determining Regions) were defined by Wu and Kabat (refs. 4 and 5) on the 
20 basis of an analysis of the variability of different regions of antibody variable regions. Three regions per 
domain were recognised. In the light chain the sequences are 24-34, 50-56, 89-97 (numbering according 
to Kabat (ref. 4), Eu Index) inclusive and in the heavy chain the sequences are 31-35, 50-65 and 95-102 
inclusive. 

When antibody structures became available it became apparent that these CDR regions cor- 
2s responded in the main to loop regions which extended from the fi barrel framework of the light and 
heavy variable domains. For H1 there was a discrepancy in that the loop was from 26 to 32 inclusive and 
for H2 the loop was 52 to 56 and for L2 from 50 to 53. However, with the exception of HI the CDR 
regions encompassed the loop regions and extended into the & strand frameworks. In H1 residue 26 
tends to be a serine and 27 a phenylalanine or tyrosine, residue 29 is a phenylalanine in most cases. 
30 Residues 28 and 30 which are surface residues exposed to solvent might be involved in antigen-binding. 
A prudent definition of the HI CDR therefore would include residues 26-35 to include both the loop 
region and the hypervariable residues 33-35. 

It is of interest to note the example of Riechmann et al (ref. 3), who used the residue 31-35 choice 
for CDR-H1. In order to produce efficient antigen binding, residue 27 also needed to be recruited from 
3S the donor (rat) antibody. 

2. Non-CDR residues which contribute to antigen binding 

By examination of available X-ray structures we have identified a number of residues which may have an 
effect on net antigen binding and which can be demonstrated by experiment. These residues can be 
sub-divided into a number of groups. 
40 2.1 Surface residues near CDR [all numbering as in Kabat et al (ref. 7)]. 

2.1.1. Heavy Chain - Key residues are 23, 71 and 73. Other residues which may contribute to a 
lesser extent are 1, 3 and 76. Finally 25 is usually conserved but the murine residue should be 
used if there is a difference. 

2.1.2 Light Chain - Many residues close to the CDRs, e.g. 63, 65, 67 and 69 are conserved. If 
45 conserved none of the surface residues in the light chain are Ukely to have a major effect 

However, if the murine residue at these positions is unusual, then it would be of benefit to analyse 
the likely contribution more closely. Other residues which may also contribute to binding are 1 and 
3, and also 60 and 70 if the residues at these positions and at 54 and 24 respectively are 
potentially able to form a salt bridge i.e. 60 + 54; 70 + 24. 
so 2.2 Packing residues near the CDRs. 

2.2.1. Heavy Chain - Key residues are 24, 49 and 78. Other key residues would be 36 if not a 
tryptophan, 94 if not an arginine, 104 and 106 if not glycines and 107 if not a threonine. Residues 
which may make a further contribution to stable packing of the heavy chain and hence improved 
affinity are 2, 4, 6, 38, 46, 67 and 69. 67 packs against the CDR residue 63 and this pair could be 
55 either both mouse or both human. Finally, residues which contribute to packing in this region but 

from a longer range are 18, 20, 80, 82 and 86. 82 packs against 67 and in turn 18 packs against 
82. 80 packs against 69 and in turn 20 packs against 80. 86 forms an H bond network with 38 and 
46. Many of the mouse-human differences appear minor e.g. Leu-lle, but could have an minor 
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impact on correct packing which could translate into altered positioning of the CDRs. 
2.22. Light Chain - Key residues are 48, 58 and 71. Other key residues would be 6 if not 
glutamine, 35 if not tryptophan, 62 if not phenylalanine or tryosine. 64, 66. 68, 99 and 101 H not 
glycines and 102 if not a threonine. Residues which make a further contribution are 2, 4, 37, 45 
and 47. Finally residues 73 and 21 and 19 may make long distance packing contributions of a 
minor nature. 

2.3. Residues at the variable domain interface between heavy and light chains - In both the light and 
heavy chains most of the non-CDR interface residues are conserved. If a conserved residue is 
replaced by a residue of different character. e.g. size or charge, it should be considered for retention 
as the murine residue. 

2.3.1. Heavy Chain - Residues which need to be considered are 37 if the residue is not a valine but 
Is of larger side chain volume or has a charge or polarity. Other residues are 39 if not a glutamine, 
45 if not a leucine. 47 if not a tryptophan, 91 if not a phenylalanine or tyrosine, 93 if not an alanine 
and 103 if not a tryptophan. Residue 89 is also at the interface but is not in a position where the 
side chain could be of great impact. 

2.3.2., Light Chain - Residues which need to be considered are 36, if not a tyrosine, 38 if not a 
glutamine, 44 if not a proline, 46, 49 if not a tyrosine, residue 85, residue 87 if not a tyrosine and 
98 if not a phenylalanine. 

2.4. Variable-Constant region interface - The elbow angle between variable and constant regions may 
be affected by alterations in packing of key residues in the variable region against the constant region 
which may affect the position of V L and V H with respect to one another. Therefore it is worth noting 
the residues likely to be in contact with the constant region. In the heavy chain the surface residues 
potentially in contact with the variable region are conserved between mouse and human antibodies 
therefore the variable region contact residues may influence the V-C interaction. In the light chain the 
amino acids found at a number of the constant region contact points vary, and the V & C regions are 
not in such close proximity as the heavy chain. Therefore the influences of the light chain V-C 
interface may be minor. 

2.4.1. Heavy Chain - Contact residues are 7. 11 F 41, 87. 108. 110, 112. 

2.4.2. Light Chain - In the light chain potentially contacting residues are 10, 12, 40, 80, 83, 103 and 
105. 

Hie above analysis coupled with our considerable practical experimental experience in the CDR- 
grafting of a number of different antibodies have lead us to the protocol given above. 

The present invention is now described, by way of example only, with reference to the accompanying 
Figures 1-13. 



Brief Description of the Figures 

Rgure 1 shows DNA and amino acid sequences of the OKT3 fight chain; 
Figure 2 shows DNA and amino acid sequences of the OKT3 heavy chain; 

Rgure 3 shows the alignment of the OKT3 fight variable region amino acid sequence with that of the 
light variable region of the human antibody REI; 

Rgure 4 shows the alignment of the OKT3 heavy variable region amino acid sequence with that of 
the heavy variable region of the human antibody KOL; 

Rgure 5 shows the heavy variable region amino acid sequences of OKT3, KOL and various 
corresponding CDR grafts; 

Rgure 6 shows the light variable region amino acid sequences of OKT3, REI and various cor- 
responding CDR grafts; 

Rgure 7 shows a graph of binding assay results for various grafted OKT3 antibodies' 

Rgure 8 shows a graph of blocking assay results for various grafted OKT3 antibodies; 

Rgure 9 shows a similar graph of blocking assay results; 

Rgure 10 shows similar graphs for both binding assay and blocking assay results; 

Figure 1 1 shows further similar graphs for both binding assay and blocking assay results; 

Figure 12 shows a graph of competition assay results for a minimally grafted OKT3 antibody 
compared with the 0KT3 murine reference standard, and 

Rgure 13 shows a similar graph of competition assay results comparing a fully grafted OKT3 
antibody with the murine reference standard. 
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DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 



EXAMPLE 1 

CDR-G RAFTING OF OKT3 
MATERIAL AND METHODS 

1. INCOMING CELLS 

Hybrtdoma cells producing antibody OKT3 were provided by Ortho (seedlot 4882.1) and were grown up 
in antibiotic free Dulbecco's Modified Eagles Medium (DMEM) supplemented with glutamtne and 5% 
foetal calf serum, and divided to provide both an overgrown supernatant tor evaluation and cells for 
extraction of RNA. The overgrown supernatant was shown to contain 250 ug/mL murine lgG2a/kappa 
antibody. The supernatant was negative for murine lambda light chain and lgG1, lgG2b. lgG3, IgA and 
IgM heavy chain. 20mL of supernatant was assayed to confirm that the antibody present was OKT3. 

2. MOLECULAR BIOLOGY PROCEDURES 

Basic molecular biology procedures were as described in Maniatts et al (ref 0) with, in some cases, 
minor modifications. DNA sequencing was performed as described in Sanger et al (ref. 11) and the 
Amersham International Pic sequencing handbook. Site directed mutagenesis was as described in 
Kramer et al (ref. 12) and the Anglian Biotechnology Ltd. handbook. COS cell expression and metabolic 
labelling studies were as described in Whittle etal (ref. 13) 

3. RESEARCH ASSAYS 

3.1. ASSEMBLY ASSAYS Assembly assays were performed on supernatants from transfected COS 
cells to determine the amount of intact IgG present 

3.1.1. COS CELLS TRANSFECTED WITH MOUSE OKT3 GENES The assembly assay for intact 
mouse IgG in COS cell supernatants was an ELISA with the following format 

96 weH microtitre plates were coated with F(ab')2 goat anti-mouse IgG Fc. The plates were washed 
in water and samples added for 1 hour at room temperature. The plates were washed and F(ab')2 
goat anti-mouse IgG F(ab')2 (HRPO conjugated) was then added. Substrate was added to reveal 
the reaction. UPC10, a mouse lgG2a myeloma, was used as a standard. 

3.1.2. COS AND CHO CELLS TRANSFECTED WITH CHIMERIC OR CDR-GRAFTED OKT3 
GENES 

The assembly assay for chimeric or CDR-grafted antibody in COS cell supernatants was an ELISA 
with the following format 

96 well microtitre plates were coated with F(ab')2 goat anti-human IgG Fc. The plates were 
washed and samples added and incubated for 1 hour at room temperature. The plates were 
washed and monoclonal mouse anti-human kappa chain was added for 1 hour at room tempera- 
ture. 

The plates were washed and F(ab')2 goat anti-mouse IgG Fc (HRPO conjugated) was added. 
Enzyme substrate was added to reveal the reaction. Chimeric B72.3 (lgG4) (ref. 13) was used as a 
standard. The use of a monoclonal anti-kappa chain in this assay allows grafted antibodies to be 
read from the chimeric standard. 

3.2. ASSAY FOR ANTIGEN BINDING ACTIVITY 

Material from COS cell supernatants was assayed for OKT3 antigen binding activity onto CD3 positive 
cells in a direct assay. The procedure was as follows: 

HUT 78 cells (human T ceil line, CD3 positive) were maintained in culture. Monolayers of HUT 78 
cells were prepared onto 96 well EUSA plates using poly-L-rysine and gtutaraldehyde. Samples were 
added to the monolayers for 1 hour at room temperature. 

The plates were washed gently using PBS. F(ab')2 goat anti-human IgG Fc (HRPO conjugated) or F- 
(ab^ goat anti-mouse IgG Fc (HRPO conjugated) was added as appropriate for humanised or mouse 
samples. Substrate was added to reveal the reaction. 

The negative control for the cell-based assay was chimeric B72.3. The positive control was mouse 
Orthomune OKT3 or chimeric OKT3, when available. This cell-based assay was difficult to perform, 
and an alternative assay was developed for CDR-grafted OKT3 which was more sensitive and easier 
to carry out. 

In this system CDR-grafted OKT3 produced by COS cells was tested for its ability to bind to the CD3- 
positive HPB-ALL (human peripheral blood acute lymphocytic leukemia) cell line. It was also tested for 
its ability to block the binding of murine OKT3 to these cells. Binding was measured by the following 
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procedure: HPB-ALL cells were harvested from tissue culture. Cells were incubated at 4°C for 1 hour 
with various dilutions of test antibody, positive control antibody, or negative control antibody. The cells 
were washed once and incubated at 4°C for 1 hour with an FITOIabelled goat anti-human IgG (Fc- 
specffic. mouse absorbed). The cells were washed twice and analysed by cytofluorography. Chimeric 
OKT3 was used as .a positive control for direct binding. Cells incubated with mock- transfected COS 
cell supernatant, followed by the FITOIabelled goat anti-human IgG, provided the negative control. To 
test the ability of CDR-grafted OKT3 to block murine OKT3 binding, the HPB-ALL cells were 
incubated at 4°C for 1 hour with various dilutions of test antibody or control antibody. A fixed 
saturating amount of FITC OKT3 was added. The samples were incubated for 1 hour at 4°C, washed 
twice and analysed by cytofluorography. FITC-labelled OKT3 was used as a positive control to 
determine maximum binding. Unlabeiled murine OKT3 served as a reference standard for blocking. 
Negative controls were unstained cells with or without mock-transfected cell supernatant The ability of 
the CDR-grafted OKT3 light chain to bind C03-positive celts and block the binding of murine OKT3 
was initially tested in combination with the chimeric OKT3 heavy chain. The chimeric OKT3 heavy 
chain is composed of the murine OKT3 variable region and the human lgG4 constant region. Tlie 
chimeric heavy chain gene is expressed in the same expression vector used for the CDR-grafted 
genes. The CDR-grafted light chain expression vector and the chimeric heavy chain expression vector 
were co-transfected into COS ceils. The fully chimeric OKT3 antibody (chimeric light chain and 
chimeric heavy chain) was found to be fully capable of binding to CD3 positive cells and blocking the 
binding of murine OKT3 to these cells. 
3.3 DETERMINATION OF RELATIVE BINDING AFFINITY 

The relative binding affinities of CDR-grafted anU-CD3 monoclonal antibodies were determined by 
competition binding (ret 6) using tiie HPB-ALL human T cell line as a source of CD3 antigen, and 
fluorescein-conjugated murine OKT3 (FI-OKT3) of known binding affinity as a tracer antibody. The 
binding affinity of FVOKT3 tracer antibody was determined by a direct binding assay in which 
increasing amounts of F1-OKT3 were incubated with HPB-ALL (5x10 s ) in PBS with 5% foetal calf 
serum for 60 min. at 4°C. Cells were washed, and the fluorescence intensity was determined on a 
FACScan flow cytometer calibrated with quantitative microbead standards (Flow Cytometry Standards, 
Research Triangle Park, NC). Fluorescence intensity per antibody molecule (F/P ratio) was deter- 
mined by using microbeads which have a predetermined number of mouse IgG antibody binding sites 
(Simply Cellular beads, Flow Cytometry Standards). F/P equals the fluorescence intensity of beads 
saturated with FI-OKT3 divided by the number of binding sites per bead. The amount of bound and 
free R-OKT3 was calculated from the mean fluorescence intensity per cell, and the ratio of bound/free 
was plotted against the number of moles of antibody bound. A linear fit was used to determine the 
affinity of binding (absolute value of the slope). 

For competitive binding, increasing amounts ol competitor antibody were added to a sub-saturating 
dose of F1-OKT3 and incubated with 5x10 s HPB-ALL in 200 mi of PBS with 5% foetal calf serum, for 
60 min at 4°C. The fluorescence intensities of the cells were measured on a FACScan flow cytometer 
calibrated with quantitative microbead standards. The concentrations of bound and free FI-OKT3 were 
calculated. The affinities of competing anti-bodies were calculated from the equation [XJ-[OKT3] = 
(1/IC<) - (1/Ka), where Ka is the affinity of murine OKT3, Kx is the affinity of competitor X {] is the 
concentration of competitor antibody at which bound/free binding is R/2, and R is the maximal 
bound/free binding. 

4. cDNA LIBRARY CONSTRUCTION 

4.1. mRNA PREPARATION AND cDNA SYNTHESIS 

OKT3 producing cells were grown as described above and 1.2 x 10 9 cells harvested and mRNA 
extracted using the guanidinium/Lia extraction procedure. cDNA was prepared by priming from Ollgo- 
dT to generate fun length cDNA. The cDNA was methylated and EcoR1 linkers added for cloning. 

4.2. LIBRARY CONSTRUCTION 

The cDNA library was ligated to pSP65 vector DNA which had been EcoR1 cut and the 5* phosphate 
groups removed by calf intestinal phosphatase (EcoRl/ClP). The ligation was used to transform high 
transformation efficiency Escherichia coli (E.coli) HB101. A cDNA library was prepared. 3600 colonies 
were screened for the light chain and 10000 colonies were screened for the heavy chain. 

5. SCREENING 

E.coli colonies positive for either heavy or light chain probes were identified by oligonucleotide screening 
using the oligonucleotides: 5* TCCAGATGTTAACTGCTCAC for the light chain, which is complementary 
to a sequence in the mouse kappa constant region, and 5' CAGGGGCCAGTGGATGGATAGAC for the 
heavy chain which is complementary to a sequence in the mouse IgG2a constant CH1 domain region. 12 
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light chain and 9 heavy chain clones were identified and taken for second round screening. Positive 
clones from the second round of screening were grown up and DNA prepared. The sizes of the gene 
inserts were estimated by gel electrophoresis and inserts of a size capable of containing a full length 
cDNA were subcloned into M13 for DNA sequencing. 

6. DNA SEQUENCING 

Clones representing four size classes for both heavy and light chains were obtained in Ml 3. DNA 
sequence for the 5' untranslated regions, signal sequences, variable regions and 3* untranslated regions 
of full length cDNAs [Rgures 1(a) and 2(a) J were obtained and the corresponding amino acid sequences 
predicted [(Rgures 1(b) and 2(b)]. In Figure 1(a) the untranslated DNA regions are shown in uppercase, 
and in both Rgures 1 and 2 the signal sequences are underlined. 

7. CONSTRUCTION OF cDNA EXPRESSION VECTORS 

Celltech expression vectors are based on the ptasmid pEEShCMV (ref. 14). A polylinker lor the insertion 
of genes to be expressed has been introduced after the major immediate early promoter/enhancer of the 
human Cytomegalovirus (hCMV). Marker genes for selection of the plasmid in transfected eukaryotic 
cells can be inserted as BamH1 cassettes in the unique BamHI site of pEE6 hCMV; for instance, the 
rteo marker to provide pEE6 hCMV neo. It is usual practice to insert the neo and gpt markers prior to 
insertion of the gene of interest, whereas the GS marker is inserted last because of the presence of 
internal EcoR1 sites in the cassette. 

The selectable markers are expressed from the SV40 late promoter which also provides an origin of 
replication so that the vectors can be used for expression in the COS cell transient expression system. 
The mouse sequences were excised from the Ml 3 based vectors described above as EcoR1 fragments 
and cloned into either pEE6-hCMV-neo for the heavy chain and into EE6-hCMV-gpt for the light chain to 
yield vectors pJA136 and pJA135 respectively, 
a EXPRESSION OF cDNAS IN COS CELLS 

Plasmtds pJA135 and pJA136 were co-transfected into COS cells and supernatant from the transient 
expression experiment was shown to contain assembled antibody which bound to T-cell enriched 
lymphocytes. Metabolic labelling experiments using 35 S methionine showed expression and assembly of 
heavy and light chains. 
9. CONSTRUCTION OF CHIMERIC GENES 

Construction of chimeric genes followed a previously described strategy [Whittle et al (ref. 13)]. A 
restriction site near the 3* end of the variable domain sequence is identified and used to attach an 
oligonucleotide adapter coding for the remainder of the mouse variable region and a suitable restriction 
site for attachment to the constant region of choice. 
9.1. LIGHT CHAIN GENE CONSTRUCTION 

The mouse light chain cDNA sequence contains an Aval site near the 3' end of the variable region 
[Rg. 1(a)]. The majority of the sequence of the variable region was isolated as a 396 bp. EcoRI-Aval 
fragment. An oligonucleotide adapter was designed to replace the remainder of the 3' region of the 
variable region from the Aval site and to include the 5' residues of the human constant region up to 
and including a unique Narl site which had been previously engineered into the constant region. 
A Hindi 1 1 site was introduced to act as a marker for insertion of the linker. 

The linker was ligated to the V L fragment and the 413 bp EcoRI-Narl adapted fragment was purified 
from the ligation mixture. 

The constant region was isolated as an Narl -BamHI fragment from an Ml 3 clone NW361 and was 
ligated with the variable region DNA into an EcoRI /BamHI /C1P pSP65 treated vector in a three way 
reaction to yield plasmid JA143. Clones were isolated after transformation into E.coli and the linker 
and junction sequences were confirmed by the presence of the Hindi 11 site and by DNA sequencing. 
9.2 LIGHT CHAIN GENE CONSTRUCTION - VERSION 2 

The construction of the first chimeric light chain gene produces a fusion of mouse and human amino 
acid sequences at the variable-constant region junction. In the case of the OKT3 fight chain the amino 
acids at the chimera junction are: 

Leu-Glu- 1 le-Asn-Arq/ - /Thr -Val-Ala -Ala 

VARIABLE CONSTANT 

This arrangement of sequence introduces a potential site for Asparagine (Asn) linked (N-linked) 
glycosylation at the V-C junction. Therefore, a second version of the chimeric light chain 
oligonucleotide adapter was designed in which the threonine <Thr). the first amino acid of the human 
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constant region, was replaced with the equivalent amino acid from the mouse constant region. Alanine 
(Ala). 

An internal Hindi 11 site was not included in this adapter, to differentiate the two chimeric light chain 
genes. 

s The variable region fragment was isolated as a 376 bp EcoRI -Aval fragment The oligonucleotide tinker 
was ligated to Narl cut pNW361 and then the adapted 396bp constant region was isolated after recutting 
the modified pNW361 with EcoRI. The variable region fragment and the modified constant region 
fragment were ligated directly into EcoRI /C1P treated pEE6hCMVneo to yield pJA1 37. Initially alt clones 
examined had the insert in the incorrect orientation. Therefore, the insert was re-isolated and rectoned to 
10 turn the insert round and yield plasmid pJA141. Several clones with the insert in the correct orientation 
were obtained and the adapter sequence of one was confirmed by DNA sequencing 
9.3. HEAVY CHAIN GENE CONSTRUCTION 

9.3.1. CHOICE OF HEAVY CHAIN GENE ISOTYPE 
The constant region isotype chosen tor the heavy chain was human lgG4. 
is 9.3.2. GENE CONSTRUCTION 

The heavy chain cDNA sequence showed a Ban1 site near the 3' end of the variable region [Fig. 2(a)- 
J. The majority of the sequence of the variable region was isolated as a 426bp. EcoRI /C1P/Banl 
fragment. An oligonucleotide adapter was designated to replace the remainder of the 3* region of the 
variable region from the Banl site up to and Including a unique Hrtdlll site which had been previously 
20 engineered into the first two amino acids of the constant region. 

The linker was ligated to the V H fragment and the EcoRI -Hindi 11 adapted fragment was purified from 
the ligation mixture. The variable region was ligated to the constant region by cutting pJA91 with 
EcoRI and Hindi 11 removing the intron fragment and replacing it with the V H to yield pJA142. Clones 
were isolated after transformation into Exoli JM101 and the linker and junction sequences were 
2$ confirmed by DNA sequencing. (N.B. The Hindi 1 1 site is lost on cloning). 

10. CONSTRUCTION OF CHIMERIC EXPRESSION VECTORS 
10.1. neo AND got VECTORS 

The chimeric fight chain (version 1) was removed from pJA143 as an EcoRI fragment and cloned into 
EcoRI/CIP treated pEE6hCMVneo expression vector to yield pJA145. Clones with the insert in the 
30 correct orientation were identified by restriction mapping. 

The chimeric light chain {version 2) was constructed as described above. 

The chimeric heavy chain gene was isolated from pJA142 as a 2.5Kbp EcoRI /Bam H1 fragment and 
cloned into the EcoRI /Bcl1/C1P treated vector fragment of a derivative of pEE6hCMVgpt to yield 
plasmid pJA144. 

35 1 0.2. GS SEPARATE VECTORS 

GS versions of pJA141 and pJAl44 were constructed by replacing the neo and gpt cassettes by a 
BamH1/Sa11/C1P treatment of the plasmtds, isolation of the vector fragment and ligation to a GS- 
containing fragment from the plasmid pR049 to yield the light chain vector pJA179 and the heavy 
chain vector pJAtSO. 

40 10.3. GS SINGLE VECTOR CONSTRUCTION 

Single vector constructions containing the cL {chimeric light), cH (chimeric heavy) and GS genes on 
one plasmid in the order cL-cH-GS, or cH-cL-GS and with transcription of the genes being head to tail 
e.g. cl>cH>GS were constructed. These plasmids were made by treating pJA179 or 0JAI8O with 
BamH1/C1P and ligating in a Bgl 11 /Hindi 11 hCMV promoter cassette along with either the 

45 Hind111/BamH1 fragment from pJA141 into pJA180 to give the cH-cL-GS plasmid pJA182 or the 

Hindu 1 /Bam H1 fragment from pJA144 into pJA179 to give the cL-cH-GS plasmid pJA181. 

11. EXPRESSION OF CHIMERIC GENES 
1 1.1. EXPRESSION IN COS CELLS 

The chimeric antibody plasmid pJA145 (cL) and pJM44 (cH) were co-transfected into COS cells and 
so supernatant from the transient expression experiment was shown to contain assembled antibody 

which bound to the HUT 78 human T-celi line. Metabolic labelling experiments using 16 S methionine 
showed expression and assembly of heavy and tight chains. However the fight chain mobility seen on 
reduced gels suggested that the potential glycosylation site was being glycosylated. Expression in 
COS cells in the presence of tunicamycin showed a reduction in si2e of the light chain to that shown 
55 for control chimeric antibodies and the OKT3 mouse light chain. Therefore JA141 was constructed 
and expressed. In this case the light chain did not show an aberrant mobility or a size shift in the 
presence or absence of tunicamycin. This second version of the chimeric tight chain, when expressed 
in association with chimeric heavy (cH) chain, produced antibody which showed good binding to HUT 
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78 cells. In both cases antigen binding was equivalent to that of the mouse antibody. 

11.2 EXPRESSION IN CHINESE HAMSTER OVARY (CHO) CELLS Stable cell lines have been 

prepared from ptasmids PJA141/pJAl44 and from pJA179/pJA180, pJA181 and pJA182 by transfec- 

tion into CHO cells, 
s 12. CDR-GRAFT1NG 

The approach taken was to try to introduce sufficient mouse residues into a human variable region 
framework to generate antigen binding activity comparable to the mouse and chimeric antibodies. 

12.1. VARIABLE REGION ANALYSIS 

From an examination of a small database of structures of antibodies and antigen-antibody complexes 
10 it is clear that only a small number of antibody residues make direct contact with antigen. Other 

residues may contribute to antigen binding by positioning the contact residues in favourable configu- 
rations and also by inducing a stable packing of the individual variable domains and stable interaction 
of the light and heavy chain variable domains. 
The residues chosen for transfer can be identified in a number of ways: 
is (a) By examination of antibody X-ray crystal structures the antigen binding surface can be 

predominantly located on a series of loops, three per domain, which extend from the B-barrel 
framework. 

(b) By analysis of antibody variable domain sequences regions of hypervariabtlfty {termed the 
Complementarity Determining Regions (CDRs) by Wu and Kabat (ref. 5)] can be identified. In the 

20 most but not all cases these CORs correspond to. but extend a short way beyond, the loop regions 

noted above. 

(c) Residues not identified by (a) and (b) may contribute to antigen binding directly or indirectly by 
affecting antigen binding site topology, or by inducing a stable packing of the individual variable 
domains and stabilising the Inter-variable domain interaction. These residues may be identified 

25 either by superimposing the sequences for a given antibody on a known structure and looking at 

key residues for their contribution, or by sequence alignment analysis and noting "tcfiosyncratic" 
residues followed by examination of their structural location and likely effects. 
12.1.1. LIGHT CHAIN 

Figure 3 shows an alignment of sequences for the human framework region RE1 and the OKT3 

30 tight variable region. The structural loops (LOOP) and CDRs (KABAT) believed to correspond to the 

antigen binding region are marked. Also marked are a number of other residues which may also 
contribute to antigen binding as described in 13.1(c). Above the sequence in Figure 3 the residue 
type indicates the spatial location of each residue side chain, derived by examination of resolved 
structures from X-ray crystallography analysis. The key to this residue type designation is as 

35 follows: 

N - near to CDR (From X-ray Structures) 
P - Packing B - Buried Non-Packing 
S - Surface E - Exposed 
I - Interface * - Interface 

40 - Packing/Part Exposed 

? - Non-CDR Residues which may require to be left as Mouse sequence. Residues underlined in 
Figure 3 are amino acids. RE1 was chosen as the human framework because the light chain is a 
kappa chain and the kappa variable regions show higher homology with the mouse sequences than 
a lambda light variable region, e.g. KOL (see below). RE1 was chosen in preference to another 

45 kappa light chain because the X-ray structure of the light chain has been determined so that a 

structural examination of individual residues could be made. 
12/1.2. HEAVY CHAIN 

Similarly Figure 4 shows an alignment of sequences for the human framework region KOL and the 
OKT3 heavy variable region. The structural loops and CDRs believed to correspond to the antigen 

so binding region are marked. Also marked are a number of other residues which may also contribute 

to antigen binding as described in 12.1(c). The residue type key and other indicators used in 
Figure 4 are the same as those used in Figure 3. KOL was chosen as the heavy chain framework 
because the X-ray structure has been determined to a better resolution than, for example, NEWM 
and also the sequence alignment of OKT3 heavy variable region showed a slightly better homology 

55 to KOL than to NEWM. 

12.2. DESIGN OF VARIABLE GENES 

The variable region domains were designed with mouse variable region optimal codon usage 
[Grantham and Perrin (ref. 15)] and used the B72.3 signal sequences [Whittle et al (ref. 13)]. The 
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sequences were designed to be attached to the constant region in the same way as for the chimeric 
genes described above. Some constructs contained the "Kozak consensus sequence" [Kozak (ret. 
16)] directly linked to the 5' of the signal sequence in the gene. This sequence motif is believed to 
have a beneficial role In translation initiation in eukaryotes. 
12.3. GENE CONSTRUCTION 

To build the variable regions, various strategies are available. The sequence may be assembled by 
using oligonucleotides in a manner similar to Jones et al (ret 17) or by simultaneously replacing all of 
the CDRs or loop regions by oligonucleotide directed site specific mutagenesis in a manner similar to 
Vemoeyen et al (ref. 2). Both strategies were used and a list of constructions is set out in Tables 1 
and 2 and Figures 4 and 5. It was noted in several cases that the mutagenesis approach led to 
deletions and rearrangements in the gene being remodelled, while the success of the assembly 
approach was very sensitive to the quality of the oligonucleotides. 
13. CONSTRUCTION OF EXPRESSION VECTORS 

Genes were isolated from M13 or SP65 based intermediate vectors and cloned into pEE6hCMVneo for 
the light chains and pEE6hCMVgpt for the heavy chains in a manner similar to that for the chimeric 
genes as described above. 
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TABLE 1 CDR-CRAJTED GENE CONSTRUCTS 

CODE HOUSE SEQUENCE METHOD OF 

CONTENT CONSTRUCTION 



KOZAK 
SEQUENCE 



10 



UCHT CHAIN All HUMAN 
121 26-32. 50-56, 91 
121A 26-32. 50*56, 91 

♦1. 3. 46. 47 
121B 26-32. 50-56, 91 

+ 46. 47 
221 24-24. 50-56. 91 
221A 24-34, 50-56, 91 

+1. 3, 46. 47 
221B 24-34. 5056. 91 

+1. 3 

221C 24-34. 50-56. 91 



FRAKEUORK R£l 

•96 Inclusive SDK and gene assembly + n.d. 

-96 inclusive Partial gene assembly n.d. + 

-96 inclusive Parti*! gene assembly n.d. + 

-96 inclusive Partial gene assembly + + 

-96 inclusive Partial gene assembly + + 

-96 inclusive Partial gene assenbly + + 

•96 inclusive Partial gene assembly + + 



25 



30 



HEAVY CHAIN 



121 
131 
141 
321 
331 

341 

34LA 

341B 



45 



50 



KEY 

n.d. 
SDK 



26-32. 
26-32, 
26-32, 
26-35, 



ALL HUMAN FRAKEUORK KDL 
50-56. 9S-100B Inclusive 
50-58. 95-100fi Inclusive 
50-65. 95-lOOB inclusive 
50-56. 95-lOOB inclusive 



26-35. 50 -SB. 95-lOOB inclusive 

26-35, 50-65. 95-lOOB inclusive 

26-35, 50-65. 95-lOOB Inclusive 
+6. 23. 24, 48, 49, 71, 73. 76. 
78. 88. 91 (+63 - human) 
26-35, 50-65. 95-lOOB inclusive 
+ 48. 49, 71. 73. 76. 78. 88. 91 
(+63 •* human) 



n.d. 



Gene assembly n.d. + 

Gene assembly n.d. + 

Partial gene assembly + n.d. 

Partial gene assembly + 

Partial gene assembly + 
Gene assembly 

SDH + 
Partial gene assembly 

Gene assembly n.d. 



Gene assembly 



n.d. 



not done 

Site directed mutagenesis 
Gene assembly Variable region assembled entirely from oligonucleotides 
Partial gene Variable region assembled by combination of restriction 
assembly fragnents eicher from other genes originally created by SDK 
and gene assembly or by oligonucleotide assembly of pare of 
the variable region and reconstruction with restriction 
fragments from other genes originally created by SDK and gene 
assembly 



14. EXPRESSION OF CDR-G RAFTED GENES 

14.1. PRODUCTION OF ANTIBODY CONSISTING OF GRAFTED LIGHT (gL) CHAINS WITH MOUSE 
HEAVY (roH) OR CHIMERIC HEAVY (cH) CHAINS 

All gL chains, in association with mH or cH produced reasonable amounts of antibody. Insertion of the 
Kozak consensus sequence at a position 5' to the ATG (kgL constructs) however, led to a 2-5 fold 
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improvement in net expression. Over an extended series of experiments expression levels were raised 
from approximately 200ng/ml to approximately 500 ng/ml for kgUcH or kgUmH combinations. 
When direct binding to antigen on HUT 78 cells was measured, a construct designed to include 
mouse sequence based on loop length (gL121) did not lead to active antibody in association with mH 
or cH. A construct designed to include mouse sequence based on Kabat CDRs (gl_221) demonstrated 
some weak binding in association with mH or cH. However, when framework residues 1 . 3, 46, 47 
were changed from the human to the murine OKT3 equivalents based on the arguments outlined in 
Section 12.1 antigen binding was demonstrated when both of the new constructs, which were termed 
121 A and 221 A were co-expressed with cH. When the effects of these residues were examined in 
more detail, it appears that residues 1 and 3 are not major contributing residues as the product of the 
gL2218 gene shows little detectable binding activity in association with cH. The light chain product of 
gL221C. in which mouse sequences are present at 46 and 47. shows good binding activity in 
association with cH. 

14.2 PRODUCTION OF ANTIBODY CONSISTING OF GRAFTED HEAVY (gH) CHAINS WITH MOUSE 
LIGHT (mL) OR CHIMERIC LIGHT (cL) CHAINS 

Expression of the gH genes proved to be more difficult to achieve than for gL First inclusion of the 
Kozak sequence appeared to have no marked effect on expression of gH genes. Expression appears 
to be slightly improved but not to the same degree as seen for the grafted light chain. 
Also, it proved difficult to demonstrate production of expected quantities of material when the loop 
choice (amino acid 26-32) for CDR1 is used, e.g. gH121, 131, 141 and no conclusions can be drawn 
about these constructs. 

Moreover, co-expression of the gH341 gene with cL or mL has been variable and has tended to 
produce lower amounts of antibody than the cH/cL or mH/mL combinations. The alterations to gH341 
to produce gH341A and gH341B lead to improved levels of expression. 

This may be due either to a general increase in the fraction of mouse sequence in the variable region, 
or to the alteration at position 63 where the residue is returned to the human amino acid Valine (Vat) 
from Phenylalanine (Phe) to avoid possible internal packing problems with the rest of the human 
framework. This arrangement also occurs in gH331 and gH321. 

When gH321 or gH331 were expressed in association with cL, antibody was produced but antibody 
binding activity was not detected. 

When the more conservative gH341 gene was used antigen binding could be detected in association 
with cL or mL, but the activity was only marginally above the background level. When further mouse 
residues were substituted based on the arguments in 12.1. antigen binding could be clearly 
demonstrated for the antibody produced when kgH341A and kgH341B were expressed in association 
with cL. 

1 4.3 PRODUCTION OF FULLY CDR-G RAFTED ANTIBODY 

The kgL221A gene was co-expressed with kgH341, kgH341A or kgH341B. For the combination 
kgH221 A/kgH341 very tittle material was produced in a normal COS cell expression. 
For the combinations kgL221A/kgH341A or kgH221A/kgH341B amounts of antibody similar to gL/cH 
was produced. 

In several experiments no antigen binding activity could be detected with kgH221A/gH341 or 
kgH221A/kgH341 combinations, although expression levels were very low. 

Antigen binding was detected when kgL221A/kgH341A or kgH22lA/kgH341B combinations were 
expressed. In the case of the antibody produced from the kgL22lA/kgH341A combination the antigen 
binding was very similar to that of the chimeric antibody. 
An analysts of the above results is given below. 
IS. DISCUSSION OF CDR-GRAFTING RESULTS 

In the design of the fully humanised antibody the aim was to transfer the minimum number of mouse 
amino acids that would confer antigen binding onto a human antibody framework. 
15.1. LIGHT CHAIN 

15.1.1. EXTENT OF THE CDRs 

For the light chain the regions defining the loops known from structural studies of other antibodies 
to contain the antigen contacting residues, and those hypervariable sequences defined by Kabat et 
al (refs. 4 and 5) as Complementarity Determining Regions (CDRs) are equivalent for CDR2. For 
CDR1 the hypervariable region extends from residues 24-34 inclusive while the structural loop 
extends from 26-32 inclusive. In the case of OKT3 there is only one amino acid difference between 
the two options, at amino acid 24, where the mouse sequence is a serine and the human 
framework RE1 has glutamine. For CDR3 the loop extends from residues 91-96 inclusive while the 
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Kabat hypervariability extends from residues 89-97 inclusive. For OKT3 amino acids 89, 90 and 97 
are the same between OKT3 and RE1 (Rg. 3). When constructs based on the loop choice for 
CDR1 (flL121) and the Kabat choice (gL221) were made and co-expressed with mH or cH no 
evidence for antigen binding activity could be found for gLl2t , but trace activity could be detected 
for the gL221, suggesting that a single extra mouse residue in the grafted variable region could 
have some detectable effect. Both gene constructs were reasonably well expressed in the transient 
expression system. 
15.1.2. FRAMEWORK RESIDUES 

The remaining framework residues were then further examined, in particular amino acids known 
from X-ray analysis of other antibodies to be close to the CDRs and also those amino acids which 
in OKT3 showed differences from the consensus framework for the mouse subgroup (subgroup VI) 
to which OKT3 shows most homology. Four positions 1, 3. 46 and 47 were identified and their 
possible contribution was examined by substituting the mouse amino acid for the human amino 
acid at each position. Therefore gL221A <gL221 + DlQ, G3V. L46R. L47W, see Figure 3 and 
Table 1) was made, cloned in EE6hCMVneo and co-expressed with cH (pJA144). The resultant 
antibody was well expressed and showed good binding activity. When the related genes gL2218 
(gL22l + D1Q, 03V) and gL221C (gL221 + L46R, L47W) were made and similarly tested, while 
both genes produced antibody when co-expressed with cH. only the gL22lC/cH combination 
showed good antigen binding. When the gL121A (gL121 + D1G. Q3V, L46R. L47W) gene was 
made and co-expressed with cH, antibody was produced which also bound to antigen. 
15-2. HEAVY CHAIN 

15.2.1. EXTENT OF THE CDRs 

For the heavy chain the loop and hypervariability analyses agree only in COR3. For CDR1 the loop 
region extends from residues 26-32 inclusive whereas the Kabat CDR extends from residues 31-35 
inclusive. For CDR2 the loop region Is from 50-58 inclusive while the hypervariable region covers 
amino acids 50-65 inclusive. Therefore humanised heavy chains were constructed using the 
framework from antibody KOL and with various combinations of these CDR choices, including a 
shorter choice for CDR2 of 50-56 inclusive as there was some uncertainty as to the definition of the 
end point for the CDR2 loop around residues 56 to 58, The genes were co-expressed with mL or 
cL initially. In the case of the gH genes with loop choices for CDR1 e.g. gH121. gH131, gH141 very 
little antibody was produced in the culture supematants. As no free light chain was detected it was 
presumed that the antibody was being made and assembled inside the ceil but that the heavy 
chain was aberrant in some way, possibly incorrectly folded, and therefore the antibody was being 
degraded internally. In some experiments trace amounts of antibody could be detected in 35 S 
labelling studies. 

As no net antibody was produced, analysts of these constructs was not pursued further. 
When, however, a combination of the loop choice and the Kabat choice for CDR1 was tested 
(mouse amino acids 26-35 inclusive) and in which residues 31 (Ser to Arg). 33 (Ala to Thr), and 35 
(Tyr to His) were changed from the human residues to the mouse residue and compared to the 
first series, antibody was produced for gH321, kgH331 and kgH341 when co-expressed with cL 
Expression was generally low and could not be markedly improved by the insertion of the Kozak 
consensus sequence 5* to the ATG of the signal sequence of the gene, as distinct from the case of 
the gL genes where such insertion led to a 2-5 fold increase in net antibody production. However, 
only in the case of gH341/mL or kgH341/cL could marginal antigen binding activity be dem- 
onstrated. When the kgH341 gene was co-expressed with kgL221A, the net yield of antibody was 
too low to give a signal above the background level in the antigen binding assay. 
15.2^. FRAMEWORK RESIDUES 

As in the case of the tight chain the heavy chain frameworks were re-examined. Possibly because 
of the tower initial homology between the mouse and human heavy variable domains compared to 
the tight chains, more amino acid positions proved to be of interest Two genes kgH341A and 
kgH341B were constructed, with 11 or 8 human residues respectively substituted by mouse 
residues compared to gH341, and with the CDR2 residue 63 returned to the human amino add 
potentially to improve domain packing. Both showed antigen binding when combined with cL or 
kgL221A, the kgH341A gene with all 11 changes appearing to be the superior choice. 
15.3 INTERIM CONCLUSIONS 

It has been demonstrated, therefore, for OKT3 that to transfer antigen binding ability to the humanised 
antibody, mouse residues outside the CDR regions defined by the Kabat hypervariability or structural 
toop choices are required for both the light and heavy chains. Fewer extra residues are needed for the 
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light chain, possibly due to the higher initial homology between the mouse and human kappa variable 
regions. 

Of the changes seven (1 and 3 from the light chain and 6. 23. 71, 73 and 76 from the heavy chain) 
are predicted from a knowledge of other antibody structures to be either partly exposed or on the 
antibody surface. It has been shown here that residues 1 and 3 in the light chain are not absolutely 
required to be the mouse sequence; and for the heavy chain the gH341B heavy chain in combination 
with the 221A Bght chain generated only weak binding activity. Therefore the presence of the 6. 23 
and 24 changes are important to maintain a binding affinity similar to that of the murine antibody. It 
was important, therefore, to further study the individual contribution of othe other 8 mouse residues of 
the kgH341A gene compared to kgH341. 
16. FURTHER CDR-GRAFTING EXPERIMENTS 

Additional CDR-grafted heavy chain genes were prepared substantially as described above. With 
reference to Table 2 the further heavy chain genes were based upon the gh341 (plasmid pJA178) and 
gH341 A (ptasmkJ pJA185) with either mouse OKT3 or human KOL residues at 6, 23, 24, 48, 49, 63, 71, 
73, 76, 78, 88 and 91. as indicated. The CDR-grafted light chain genes used in these further experiments 
were gL22l, gL22lA, gL221B and gl_22lC as described above. 
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OKT3 HEAVY CHAIN CDR CRAFTS 
g — 

1. gH341 and derivatives 
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gH341E 
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F JA209 
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gH34lD 
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K 
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gH341* 
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gH341C 
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C 


F JA184 
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Y JA205 




gH341B 
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S 
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K 
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A 


A 


Y JA183 




gH34l* 
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gH341* 
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A 




G 
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gH3/»l* 
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OKT3 LIGHT CHAIN CDR GRAFTS 
2. gL221 and derivatives 
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gL22lA 
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L DA221B 


CL221C 


D 


Q 
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RE1 


D 


Q 
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50 

MURINE RESIDUES ARE UNDERLINED 

The CDR-grafted heavy and light chain genes were co^expressed in COS cells either with one another 
55 in various combinations but also with the corresponding murine and chimeric heavy and light chain genes 
substantially as described above. The resultant antibody products were then assayed in binding and 
blocking assays with HPB-ALL cells as described above. 
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The results of the assays for various grafted heavy chains co-expressed with the gL221C light chain are 
given in Figures 7 and 8 (for the JA184, JA185. JA197 and JA198 constructs - see Table 2). in Figure 9 (for 
the JA183. JA184, JA185 and JA197 constructs) in Figure 10 (for the chimeric, JA185. JA199. JA204. 
JA205, JA207. JA208 and JA209 constructs) and in Figure 11 (for the JA183. JA184, JA185. JA19»8. JA203. 

5 JA205 and JA206 constructs). 

The basic grafted product without any human to murine changes in the variable frameworks, i.e. gL221 
co-expressed with gh341 (JA178), and also the "fully grafted" product, having most human to murine 
changes in the grafted heavy chain framework, i.e. gL22!C co-expressed with gh341A (JA185), were 
assayed for relative binding affinity in a competition assay against murine OKT3 reference standard, using 

to HPB-ALL cells. The assay used was as described above m section 3.3. The results obtained are given in 
Figure 12 for the basic grafted product and in Figure 13 for the fully grafted product. These results indicate 
that the basic grafted product has neglibible binding ability as compared with the OKT3 murine reference 
standard; whereas the "fully graft6<r product has a binding ability very similar to that of the OKT3 murine 
reference standard. 

is The binding and blocking assay results indicate the following: 

The JA198 and JA207 constructs appear to have the best binding characteristics and similar binding 
abilities, both substantially the same as the chimeric and fully grafted gH341 A products. This indicates that 
positions 88 and 91 and position 76 are not highly critical for maintaining the OKT3 binding ability; whereas 
at least some of positions 8, 23, 24, 48, 49, 71. 73 and 78 are more important 

20 This is borne out by the finding that the JA209 and JA199. although of similar binding ability to one 
another, are of lower binding ability than the JA198 and JA207 constructs. This indicates the importance ot 
having mouse residues at positions 71, 73 and 78, which are either completely or partially human in the 
JA199 and JA209 constructs respectively. 

Moreover, on comparing the results obtained for the JA205 and JA183 constructs it is seen that there is 

25 a decrease in binding going from the JA205 to the JA183 constructs. This indicates the importance of 
retaining a mouse residue at position 23, the only position changed between JA205 and JA183. 

These and other results lead us to the conclusion that of the 11 mouse framework residues used in the 
gH34lA (JA185) construct, it is important to retain mouse residues at all of positions 6, 23, 24. 48 and 49, 
and possibly for maximum binding affinity at 71. 73 and 78. 

30 Similar Experiments were carried out to CDR-graft a number of the rodent antibodies including 
antibodies having specificity for CD4 (OKT4), ICAM-1 (R6-5). TAG72 (B72.3). and TNF«(61E71, 101.4. 
hTNFI, hTNF2 and hTNF3). 

EXAMPLE 2 

CDR-QRAFTING OF A MURINE ANTI-CD4 T CELL RECEPTOR ANTIBODY. OKT4A 

Anti OKT4A CDR-grafted heavy and light chain genes wore prepared, expressed and tested substan- 
tially as described above in Example 1 for CDR-grafted OKT3. The CDR grafting of OKT4A is described in 

40 detail in Ortho patent application PCT/GB 90„ of even date herewith entitled "Humanised Antibodies*. 

The disclosure of this Ortho patent application PCT/GB 90 ~.. is incorporated herein by reference. A 

number of CDR-grafted OKT4 antibodies have been prepared. Presently the CDR-grafted OKT4A of choice 
is the combination of the grafted tight chain LCDR2 and the grafted heavy chain HCDR10. 

45 THE LIGHT CHAIN 

The human acceptor framework used for the grafted light chains was RE1 . The preferred LCDR2 light 
chain has human to mouse changes at positions 33, 34. 38, 49 and 89 in addition to the structural loop 
CDRs. Of these changed positions, positions 33. 34 and 89 fail within the preferred extended CDRs of the 

so present invention (positions 33 and 34 in CDR1 and position 89 in CDR3). 

The human to murine changes at positions 38 and 49 corresponds to positions at which the amino acid 
residues are preferably donor murine amino acid residues in accordance with the present invention. 
A comparison of the amino add sequences of the donor murine light chain variable domain and the RE1 
human acceptor light chain variable further reveals that the murine and human residues are identical at all 

55 of positions 46, 48 and 71 and at all of positions 2, 4. 6, 35, 36, 44, 47, 82, 64-69. 85. 87. 98. 99 and 101 
and 102. However the amino acid residue at position 58 in LCDR2 is the human RE1 framework residue not 
the mouse OKT4 residue as would be prefened in accordance with Ihe present invention. 
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THE HEAVY CHAIN 

The human acceptor framework used for the grafted heavy chains was KOL 
The preferred CDR graft HCDR10 heavy chain has human to mouse changes at positions 24. 35, 57, 58, 

s 60, 88 and 91 in addition to the structural loop CDRs. 

Of these positions, positions 35 (CDR1) and positions 57, 58 and 60 (COR2) fall within the preferred 
extended CDRs of the present invention. Also the human to mouse change at position 24 corresponds to a 
position at which the amino acid residue is a donor murine residue in accordance with the present invention. 
Moreover, the human to mouse changes at positions 88 and 91 correspond to positions at which the amino 

10 acid residues are optionally donor murine residues. 

Moreover, a comparison of the murine OKT4A and human KOL heavy chain variable amino add sequences 
reveals that the murine and human residues are identical at all of positions 23, 49. 71 , 73 and 78 and at all 
of positions 2. 4, 6, 25, 36, 37, 39, 47, 48. 93, 94. 103. 104, 106 and 107. 

Thus the OKT4A CDR-grafted heavy chain HCDR10 corresponds to a particularly preferred embodiment 
rs according to the present invention. 



EXAMPLE 3 

CDR-GRAFTING OF AN ANTI-MUCIN SPECIFIC MURINE ANTIBODY. B7£3 

20 

The cloning of the genes cooing for the anti-mucin specific murine monoclonal antibody B72.3 and the 
preparation of B72.3 mouse-human chimeric antibodies has been described previously (ref. 13 and WO 
89/01783). CDR-grafted versions of B72.3 were prepared as follows, 
(a) B72^ Light Chain 

25 CDR-grafting of this light chain was accomplished by direct transfer of the murine CDRs into the 
framework of the human light chain RE1 . The regions transferred were: 



CDR 


Residues 


Number 




1 


24-34 


2 


50-56 


3 


90-96 



The activity of the resulting grafted light chain was assessed by co-expression in COS cells, of genes for 
the combinations: 
B72.3 CH/B72.3 cL 
and B72.3 CH/B72.3 gL 

Supematants were assayed for antibody concentration and for the ability to bind to microtitre plates 
coated with mucin. The results obtained indicated that, in combination with the B72.3 cH chain, B72.3 cL 
and B72.3 gL had similar binding properties. 
Comparison of the murine B72.3 and REI fight chain amino acid sequences reveals that the residues 
are identical at positions 46, 58 and 71 but are different at position 48. Thus changing the human residue to 
the donor mouse residue at position 48 may further improve the binding characteristics of the CDR-grafted 
Light chain. (B72.3 gL) in accordance with the present invention, 
(b) B72.3 heavy chain 
i. Choice of framework 

At the outset it was necessary to make a choice of human framework. Simply put, the question was as 
follows: Was it necessary to use the framework regions from an antibody whose crystal structure was 
known or could the choice be made on some other criteria? 

For B72.3 heavy chain, it was reasoned that, while knowledge of structure was important, transfer of 
the CDRs from mouse to human frameworks might be facilitated if the overall homology between the 
donor and receptor frameworks was maximised. Comparison of the B72.3 heavy chain sequence with 
those in Kabat (rel. 4) for human heavy chains showed clearly that B72.3 had poor homology for KOL 
and NEWM (for which crystal structures are available) but was very homologous to the heavy chain 
for EU. 

On this basis, EU was chosen for the CDR-grafting and the following residues transferred as CDRs. 
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CDR 


Residues 


Number 




1 


27-36 


2 


50-63 


3 


93-102 



Also ft was noticed that the FR4 region of EU was unlike that of any other human (or mouse) antibody. 
Consequently, in the grafted heavy chain genes this was also changed to produce a "consensus" 
human sequence. (Preliminary experiments showed that grafted heavy chain genes containing the EU 
FR4 sequence expressed very poorly in transient expression systems.) 
ii. Results with grafted heavy chain genes 

Expression of grafted heavy chain genes containing all human framework regions with either gL or cL 
genes produced a grafted antibody with tittle ability to bind to mucin. The grafted antibody had about 
1% the activity of the chimeric antibody. In these experiments, however, it was noted that the activity 
of the grafted antibody could be increased to - 10% of B723 by exposure to pHs of 2-3.5. 
This observation provided a clue as to how the activity of the grafted antibody could be improved 
without acid treatment tt was postulated that acid exposure brought about the protonation of an acidic 
residue (pKa of aspartic acid = 3.86 and of gtutamine acid = 4.25) which in turn caused a change in 
structure of the CDR loops, or allowed better access of antigen. 

From comparison of the sequences of B72.3 (ref. 13) and EU (refs. 4 and 5), it was dear that, in going 
from the mouse to human frameworks, only two positions had been changed in such a way that acidic 
residues had been introduced. These positions are at residues 73 and 81 , where K to E and Q to E 
changes had been made, respectively. 

Which of these positions might be important was determined by examining the crystal structure of the 
KOL antibody. In KOL heavy chain, position 81 is far removed from either of the CDR loops. 
Position 73, however, is close to both CDRs 1 and 3 of the heavy chain and, in this position it was 
possible to envisage that a K to E change in this region could have a detrimental effect on antigen 
binding. 

Hi. Framework changes in B72.3 gH gene 

On the basis of the above analysis, E73 was mutated to a lysine (K). It was found that this change had 
a dramatic effect on the ability of the grafted Ab to bind to mucin. Further the ability of the grafted 
B72.3 produced by the mutated gH/gL combination to bind to mucin was similar to that of the B72.3 
chimeric antibody. 

iv. Other framework changes 

In the course of the above experiments, other changes were made in the heavy chain framework 
regions. Within the accuracy of the assays used, none of the changes, either alone or together, 
appeared beneficial. 

v. Other 

All assays used measured the ability of the grafted Ab to bind to mucin and. as a whole, indicated 
that the single framework change at position 73 is sufficient to generate an antibody with similar 
binding properties to B72.3. 

Comparison of the B72.3 murine and EU heavy chain sequences reveals that the mouse and human 
residues are identical at positions 23, 24, 71 and 78. 

Thus the mutated CDR-grafted B72.3 heavy chain corresponds to a preferred embodiment of the 
present invention. 

EXAMPLE 4 

CDR-GRAFTING OF A MURINE ANT1-ICAM-1 MONOCLONAL ANTIBODY 

A murine antibody, R6-5-D6 (EP 0314863) having specificity for Intercellular Adhesion Molecule 1 
(ICAM-1) was CDR-grafted substantially as described above in previous examples. This work is described in 
greater detail in co-pending application, British Patent Application No. 9009549.8, the disclosure of which is 
incorporated herein by reference. 

The human EU framework was used as the acceptor framework for both heavy and light chains. The CDR- 
grafted antibody currently of choice is provided by co-expression of grafted light chain gL221A and grafted 
heavy chain gH341D which has a binding affinity for ICAM 1 of about 75% of that of the corresponding 
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mouse-human chimeric antibody. 
LIGHT CHAIN 

5 gL221A has murine CDRs at positions 24-34 (COR1). 50-56 <CDR2) and 89-97 (CDR3). In addition 
several framework residues are also the murine amino add. These residues were chosen after consideration 
of the possible contribution of these residues to domain packing and stability of the conformation of the 
antigen binding region. The residues which have been retained as mouse are at positions 2, 3, 48 (?). 60, 
84, 85 and 87. Comparison of the murine anti-ICAM 1 and human EU light chain amino acid sequences 

to reveals that the murine and human residues are identical at positions 46, 58 and 71. 

HEAVY CHAIN 

gH341D has murine CDRs at positions 26-35 (CDR1). 50-56 (CDR2) and94~1008 (CDR3). In addition 
75 murine residues were used in gH341D at positions 24, 48. 69. 71, 73. 80, 88 and 91. Comparison of the 
murine anti-ICAM 1 and human EU heavy chain amino add sequences are identical at positions 23. 49 and 
78. 

EXAMPLES 

20 

CDR-Grafting of murine anti-TNF3 antibodies 



25 A number of murine anti-TNFa monoclonal antibodies were CDR-grafted substantially as described 
above in previous examples. These antibodies include the murine monoclonal antibodies designated 61 
E71, hTNFt, hTNF3 and 101.4 A brief summary of the CDR-grafting of each of these antibodies is given 
below. 

30 61E71 

A similar analysis as described above (Example 1. Section 12.1.) was done for 61E71 and for the heavy 
chain 10 residues were identified at 23. 24. 48, 49. 68. 69, 71, 73. 75 and 88 as residues to potentially 
retain as murine. The human frameworks chosen for CDR-grafting of this antibody, and the hTNF3 and 

35 101 .4 antibodies were RE1 for the light chain and KOL for the heavy chain. 

Three genes were built the first of which contained 23, 24. 48, 49, 71 and 73 [gH341(6)] as murine 
residues. The second gene also had 75 and 88 as murine residues EgH34l(8)] while the third gene 
additionally had 68, 69. 75 and 88 as murine residues [gH341(10)]. Each was co-expressed with gL221. the 
minimum grafted light chain (CDRs only). The gL221/gH341(6) and gL221/gH341(8) antibodies both bound 

40 as well to TNF as murine 61E71. The gl_221/gH341(10) antibody did not express and this combination was 
not taken further. 

Subsequently the gL221/gH341(6) antibody was assessed in an L929 ceil competition assay in which the 
antibody competes against the TNF receptor on L929 cells for binding to TNF in solution. In this assay the 
gL221/gH341(6) antibody was approximately 10% as active as murine 61E71. 

45 

hTNFI 

hTNFI is a monoclonal antibody which recognises an epitope on human TNF- . The EU human 
framework was used for CDR-grafting of both the heavy and fight variable domains. 

50 

Heavy Chain 

In the CDR-grafted heavy chain <ghTNF1) mouse CDRs were used at positions 26-35 (CDR1). 50-65 
(CDR2) and 95-102 (CDR3). Mouse residues were also used in the frameworks at positions 48. 67. 69. 71. 
55 73, 76, 89, 91, 94 and 108. Comparison of the TNF1 mouse and EU human heavy chain residues reveals 
that these are identical at positions 23, 24. 29 and 78. 
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light Chain 

In the CDR-grafted light chain (gLhTNFI) mouse CDRs wre used at positions 24-34 (CDR1), 50-56 
(CDR2) and 89-97 (CDR3). In addition mouse residues were used in the frameworks at positions 3. 42, 48, 
5 49, 83, 106 and 108. Comparison of the hTNFI mouse and EU human light chain residues reveals that 
these are identical at positions 46, 58 and 71. 

The grafted hTNFI heavy chain was co-expressed with the chimeric light chain and the binding ability 
of the product compared with that of the chimeric tight chain/chimeric heavy chain product in a TNF binding 
assay. The grafted heavy chain product appeared to have binding ability for TNF slightly better than the 
to fully chimeric product 

Similarly, a grafted heavy chain/grafted light chain product was co-expressed and compared with the 
fully chimeric product and found to have closely simitar binding properties to the tatter product 

hTNF3 

75 

hTNF3 recognises an epitope on human TNF-a. The sequence of hTNF3 shows only 21 differences 
compared to 61E71 in the light and heavy chain variable regions, 10 in the light chain (2 in the CDRs at 
positions 50. 98 and 8 in the framework at 1, 19, 40, 45, 46, 76, 103 and 106) and 11 in the heavy chain (3 
in the CDR regions at positions 52. 60 and 95 and 8 in the framework at 1, 10, 38, 40, 67, 73, 87 and 105). 

20 Trie light and heavy chains of the 61E71 and hTNF3 chimeric antibodies can be exchanged without loss of 
activity in the direct binding assay. However 61E71 is an order of magnitude less able to compete with the 
TNF receptor on L929 cello for TNF-a compared to hTNF3. Based on the 61 E71 CDR grafting data gL221 
and gH341(+23, 24, 48, 49 71 and 73 as mouse) genes have been built for hTNF3 and tested and the 
resultant grafted antibody binds well to TNF-a, but competes very poorly in the L929 assay. It is possible 

as that in this case also the framework residues identified for OKT3 programme may improve the competitive 
binding ability of this antibody. 

101.4 

30 101.4 is a further murine monoclonal antibody able to recognise human TNF-a. The heavy chain of this 
antibody shows good homology to KOt and so the CDR-grafting has been based on RE1 for the light chain 
and KOt for the heavy chain. Several grafted heavy chain genes have been constructed with conservative 
choices for the CDR's (gH341) and which have one or a small number of non-CDR residues at positions 73, 
78 or 77-79 inclusive, as the mouse amino acids. These have been co-expressed with cL or gt221. In all 

55 cases binding to TNF equivalent to the chimeric antibody is seen and when co-expressed with ct the 
resultant antibodies are able to compete well in the L929 assay. However, with gL221 the resultant 
antibodies are at least an order of magnitude less able to compete for TNF against the TNF receptor on 
L929 cells- 
Mouse residues at other positions in the heavy chain, for example, at 23 and 24 together or at 76 have 

40 been demonstrated to provide no improvement to the competitive ability of the grafted antibody in the L929 
assay. 

A number of other antibodies including antibodies having specificity for interteukins e.g. IL1 and cancer 
markers such as carcinoembryonic antigen (CEA) e.g. the monoclonal antibody A5B7 (ref. 21), have been 
successfully CDR-grafted according to the present invention. 
45 It will be appreciated that the foregoing examples are given by way of illustration only and are not intended 
to limit the scope of the claimed invention. Changes and modifications may be made to the methods 
described whilst still falling within the spirit and scope of the invention. 
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SEQUENCE LISTING 



10 



(I) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CELLTECH LIMITED 

(B) STREET: 216 BATH ROAO 

(C) CITY: SLOUGH 

(D) STATE: BERKSHIRE 

(E) COUNTRY: UNITED KINGDOM 

(F) POSTAL CODE (ZIP) : SL1 4 EN 

(G) TELEPHONE: 0753 534655 

(H) TELEFAX: 0753 536632 

(I) TELEX: 848473 

(ii) TITLE OF INVENTION: HUMANISED ANTIBODIES 
(iii) NUMBER OF SEQUENCES: 33 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC— DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version /1.25 

(EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCCAGATGTT AACTGCTCAC 
20 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CAGGGGCCAG TGGATGGATA GAC 
23 

(2) INFORMATION FOR SEQ ID NO: 3: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 



55 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear; 

(ii) MOLECULE TYPE: protein - 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Leu Glu He Asn Arg Thr Val Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GAATTCCCAA AGACAAAATG GATTTTCAAG TGCAGATTTT CAGCTTCCTG CTAATCAGTG 
60 

CCTCAGTCAT AATATCCAGA GGACAAATTG TTCTCACCCA GTCTCCAGCA ATCATGTCTG 
120 

CATCTCCAGG GGAGAAGGTC ACCATGACCT GCAGTGCCAG CTCAAGTGTA AGTTACATGA 
180 

ACTGGTACCA GCAGAAGTCA GGCACCTCCC CCAAAAGATG GATTTATGAC ACATCCAAAC 
240 

TGGCTTCTGG AGTCC CTGCT CACTTCAGGG GCAGTGGGTC TGGGACCTCT TACTCTCTCA 
300 

CAATCAGCGG CATGGAGGCT GAAGATGCTG CCACTTATTA CTGCCAGCAG TGGAGTAGTA 
360 

ACCCATTCAC GTTCGGCTCG GGGACAAAGT TGGAAATAAA CCGGGCTGAT ACTGCACCAA 
420 

CTGTATCCAT CTTCCCACCA TCCAGTG AG C AGTTAACATC TGGAGGTGCC TCAGTCGTGT 
480 

GCTTCTTGAA CAACTTCTAC CCCAAAGACA TCAATGTCAA GTGGAAGATT GATGGCAGTG 
540 

AACGACAAAA TGGCGTCCTG AACAGTTGGA CTGATCAGGA CAGCAAAGAC AGCACCTACA 
600 

GCATGAGCAG CACCCTCACG TTGACCAAGG ACGAGTATGA ACGACATAAC AGCTATACCT 
660 

GTGAGGCCAC TCACAAGACA TCAACTTCAC CCATTGTCAA GAGCTTCAAC AGGAATGAGT 
720 

GTTAGAGACA AAGGTCCTGA GACGCCACCA CCAGCTCCCA GCTCCATCCT ATCTTCCCTT 
780 
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CTAAGGTCTT GGAGGCTTCC CCACAAGCGC TTACCACTGT TGCGGTGCTC TAAACCTCCT 840 
CCCACCTCCT TCTCCTCCTC CTCCCTTTCC TTGGCTTTTA TC^A^GCTAA? ATTTGC^AA 900 
5 AATATTCAAT AAAGTCAGTC TTTGCCTTGA AAAAAAAAAA AAA 943 

(2) INFORMATION POR SEQ ID NO: 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 233 amino acids 

(B) TYPE: amino acid 

W (C) STRAND EDN ESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asp Phe VaL lie Phe Ser Phe Leu Leu lie Ser Ala Ser Val He 
15 10 15 

He Ser Arg Gly Gin He Val. Leu Thr Gin Ser Pro Ala He Ket Ser 
20 25 30 

Ala Ser Pro Gly Glu Lys Val Thr Met Thr Cys Ser Ala Ser Ser Ser 
35 40 45 

Val Ser Tyr Met Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys 
So 55 60 

. Arg Trp lie Tyr Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His 
65 70 75 80 

Phe Arg Gly Ser Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser Gly 
85 90 95 

Mot Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser 
100 105 110 

Asn Pro Phe Thr Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg Ala 
115 120 125 

Asp Thr Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser Glu Gin Leu 
130 135 140 

Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn Phe Tyr Pro 
145 150 155 160 

Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu Arg Gin Asn 
165 170 175 

Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser Thr Tyr 
180 185 190 

ser Met Ser Ser Thr Leu Thr Leu Thr Lys Asp Glu Tyr Glu Arg His 
195 200 205 

Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser Pro He 
210 215 220 



30 



Val Lys Ser Phe Asn Arg Asn Glu Cys 
SO 225 230 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1570 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

W 

GAATTCCCCT CTCCACAGAC ACTGAAAACT CTGACTCAAC ATGGAAAGGC ACTGGATCTT 60 

TCTACTCCTG TTGTCAGTAA CTGCAGGTGT CCACTCCCAG GTCCAGCTGC AGCAGTCTGG 120 

GGCTGAACTG GCAAGACCTG GGGCCTCAGT GAAGATGTCC TGCAAGGCTT CTGGCTACAC 180 

15 CTTTACTAGG TACACGATGC ACTGGGTAAA ACAGAGGCCT GGACAGGGTC TGGAATGGAT 240 

TGGATACATT AATCCTAGCC GTGGTTATAC TAATTACAAT CAGAAGTTCA AGGACAAGGC 300 

CACATTGACT ACAGACAAAT CCTCCAGCAC AGCCTACATG CAACTGAGCA GCCTGACATC 360 

2Q TGAGGACTCT GCAGTCTATT ACTGTG CAAG ATATTATGAT GATCATTACT GCCTTGACTA 420 

CTGGGGCCAA GGCACCACTC TCACAGTCTC CTCAGCCAAA ACAACAGCCC CATCGGTCTA 480 

TCCACTGGCC CCTGTGTGTG GAGATACAAC TGGCTCCTCG GTGACTCTAG GATGCCTGGT 540 

CAAGGGTTAT TTCCCTGAGC CAGTGACCTT GACCTGGAAC TCTGGATCCC TGTCCAGTGG 600 

25 

TGTGCACACC TTCCCAGCTG TCCTGCAGTC TGACCTCTAC ACCCTCAGCA GCTCAGTGAC 660 

TGTAACCTCG AGCACCTGGC CCAGCCAGTC CATCACCTGC AATGTGGCCC ACCCGGCAAG 720 

CAGCACCAAG GTGGACAAGA AAATTGAGCC CAGAGGGCCC ACAATCAAGC CCTGTCCTCC 780 

tn 

ATGCAAATGC CCAGCACCTA ACCTCTTGGG TGGACCATCC GTCTTCATCT TCCCTCCAAA 840 

GATCAAGGAT GTACTCATGA TCTCCCTGAG CCCCATAGTC ACATGTGTGG TGGTGGATGT 900 

GAGCGAGGAT GACCCAGATG TCCAGATCAG CTGGTTTGTG AACAACGTGG AAGTACACAC 960 

35 AGCTCAGACA CAAACCCATA GAGAGGATTA CAACAGTACT CTCCGGGTGG TCAGTGCCCT 1020 

CCCCATCCAG CACCAGGACT GGATGAGTCC CAAGGAGTTC AAATGCAAGG TCAACAACAA 1080 

AGACCTCCCA GCGCCCATCG AGAGAACCAT CTCAAAACCC AAAGGGTCAG TAAGAGCTCC 1140 

ACAGGTATAT GTCTTCCCTC CACCAGAAGA AGAGATGACT AAGAAACAGG TCACTCTGAC 1200 

CTGCATGGTC ACAGACTTCA TGCCTGAAGA CATTTACGTG GAGTGGACCA ACAACGGGAA 1260 

AACAGAGCTA AACTACAAGA ACACTGAACC AGTCCTGGAC TCTGATGGTT CTTACTTCAT 1320 

GTACAGCAAG CTGAGAGTGG AAAAGAAGAA CTGGGTGGAA AGAAATAGCT ACTCCTGTTC 1380 

ACTGGTCCAC G AGGGTCTG C ACAATCACCA CACGACTAAG AGCTTCTCCC GGACTCCGGG 1440 

TAAATGAGCT CAGCACCCAC AAAACTCTCA GGTCCAAAGA GACACCCACA CTCATCTCCA 1500 

TGCTTCCCTT GTATAAATAA AGCACCCAGC AATGCCTGGG ACCATGTAAA AAAAAAAAAA 1560 

AAAGGAATTC 1570 

(2) INFORMATION FOR SEQ ID NO: 7: 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDHESS : single 

(D) TOPOI*OGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Glu Arg His Trp lie Phe Leu Leu Leu Leu Ser Val Thr Ala Gly 
15 10 15 

Val His Ser Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg 
20 25 30 

Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe 
35 40 45 

Thr Arg Tyr Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu 
50 55 60 

Glu Trp lie Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn 
65 70 75 80 

Gin Lys Phe Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys ser Ser Ser 
85 90 95 

Thr Ala Tyr Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 110 

Tyr Tyr Cys Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp 
115 120 125 

Gly Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Lys Thr Thr Ala Pro 
130 135 140 

Ser Val Tyr Pro Leu Ala Pro Val Cys Gly Asp Thr Thr Gly Ser Ser 
145 150 155 160 

Val Thr Leu Gly Cys Leu Val Lys Gly Tyr Phe Pro Glu Pro Val Thr 
165 170 175 

Leu Thr Trp Asn Ser Gly Ser Leu Ser Ser Gly Val His Thr Phe Pro 
180 185 190 

Ala Val Leu Gin ser Asp Leu Tyr Thr Leu ser Ser Ser Val Thr Val 
195 200 205 

Thr Ser Ser Thr Trp Pro Ser Gin Ser He Thr Cys Asn Val Ala His 
210 215 220 

Pro Ala Ser Ser Thr Lys Val Asp Lys Lys He Glu Pro Arg Gly Pro 
225 230 235 240 

Thr He Lys Pro Cys Pro Pro Cys Lys Cys Pro Ala Pro Asn Leu Leu 
245 250 255 

Gly Gly Pro Ser Val Phe He Phe Pro Pro Lys He Lys Asp Val Leu 
260 265 270 

Met He Ser Leu Ser Pro He Val Thr Cys Val Val Val Asp Val Ser 
SO 275 280 285 

Glu Asp Asp Pro Asp Val Gin He Ser Trp Phe Val Asn Asn Val Glu 



55 
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290 295 300 

Val His Thr Ala Gin Thr Gin Thr His: Arg- G1U Asp-Yyr -Asn : 3er Thr 
305 310 3l£ 320 

— 

5 Leu Arg Val Val Ser Ala Leu Pro He Gin His Gin Asp Trp Met Ser 

325 330 335 

Gly Lys Glu Phe Lys Cys Lys Val Asn Asn Lys Asp Leu Pro Ala Pro 
340 345 350 

10 Arg Thr He Ser Lys Pro Lys Gly Ser Val Arg Ala Pro Gin 

355 360 365 

Val Tyr Val I.eu Pro Pro Pro Glu Glu Glu Met Thr Lys Lys Gin Val 
370 375 380 

Thr Leu Thr Cys Met Val Thr Asp Phe Met Pro Glu Asp He Tyr Val 
75 385 390 395 400 

Glu Trp Thr Asn Asn Gly Lys Thr Glu Leu Asn Tyr Lys Asn Thr Glu 
405 410 415 

Pro Val Leu Asp Ser. Asp Gly Ser Tyr Phe Met Tyr Ser Lys £eu Arg 
SO 420 425 430 

Val Glu Lys Lys Asn Trp Val Glu Arg Asn Ser Tyr Ser Cys Ser Val 
435 440 445 

val His Glu Gly Leu His Asn His His Thr Thr Lys Ser Phe Ser Arg 
450 455 460 

25 

Thr Pro Gly Lys 
465 

(2) INFORMATION FOR SEQ ID NO: 8: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



50 



(ii) MOLECULE TYPE: protein 



(Xi> SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

Gin He Val Leu Thr Gin Ser Pro Ala He Met Ser Ala Ser Pro Gly 
15 io 15 

Glu Lys val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 
20 25 30 

Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser 
50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser Gly Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 
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Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 9: 

5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRAW DEDN ESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 IS 

Asp Arg Val Thr He Thr Cys Gin Ala Ser Gin Asp He He Lys Tyr 
20 25 30 

Leu Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He 
35 40 45 

Tyr Glu Ala Ser Asn Leu Gin Ala Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Gin Gin Tyr Gin Ser Leu Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



40 SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala 
1 5 10 15 



45 



50 



55 



Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro ser Arg Gly Tyr Thr Asn Thr Asn Gin Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 



34 
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Me-t Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys: Leu As? Tyr Trp Gly Gin Gly 
100 105 110 

5 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 11: 

J0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid . 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe lie Phe Ser Ser Tyr 

20 25 30 

Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala lie He Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

Ala Arg Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser cys Phe Gly 
100 105 110 

Pro Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 12: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gin Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Ala Arg Pro Gly Ala 
15 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 
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Thr Met His Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie 

35 40 4 5'. 

Gly Tyr lie Asn Pro Ser Arg Gly ry£ Thr Asn Tyr *Asn Girt 'Lys Phe 
50 55 60 

Lys Asp Lys Ala Thr Leu Thr Thr Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

Met Gin Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 119 amino acids 

(B) TYPE i aiaino acid 

(C) STRAND EDN ESS : single 
(0) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Vat Val:Gln Pfco^Jly Arg 
1 5 10 ' 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys 
T5 85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
loo 105 HO 

Thr Thr Leu Thr Val ser ser 
20 115 

(2) INFORMATION FOR SEQ ID NO: 15: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

25 (C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

' (ri) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 



35 



45 



Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 HO 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

to Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 

15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
' S 35 40 45 

Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys val 
50 55 60 

Lys Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Asn Thr Ala Phe 
20 65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
B5 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 

25 100 105 110 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 17: 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
40 1 5 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
45 35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Ala Phe 
65 70 75 80 

50 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 
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is 



20 



25 



30 



35 



40 



45 



$0 



Ala Arq Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 XtO 

Thr Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 119 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 



Ser Leu Arg Leu Ser cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lys Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
loo 105 HO 

Thr Thr Leu Thr Val Ser Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 



ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 



35 
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55 



Gly Tyr lie Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Val 
50 55 60 

Lvs Asp Arg Phe Thr lie Ser Arg Asp. Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 HO 

Thr Thr Leu Thr Val Ser Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 20: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Asp Arg Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 9S 

Ala Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly 
100 105 HO 

40 Thr Thr Leu Thr Val Ser Ser 

115 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 
45 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



40 
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Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
. 1 - 5 10 ; IS 

Ser Leu Arg Leu Ser Cys Ser Ala Ser; Gly Tyr Thr Phe Thr Axg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 €0 

Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
85 90 95 

Arg Tyr Tyr Asp Asp His Tyr cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 110 

Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 22: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRAW DEON ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 

Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 110 

Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 118 amino acids 
(B) TYPE: amino acid 
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(C) STRAND EDN ESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Xle 
35 40 45 

T5 Glv Tvr lie Asn Pro ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 

50 55 60 

Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
20 85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 HO 

Thr Leu Thr Val Ser Ser 
25 115 

(2) INFORMATION FOR SEQ ID NO: 24: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

30 (C) STRAND EDN ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
x 5 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
40 20 25 30 

Thr Met His Trp Val* Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
45 50 55 60 

Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys Ala 
85 90 95 

60 

Arq Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 110 
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TO 



15 



20 



25 



30 



35 



40 



45 



50 



Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID HO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEON ESS : single 

(D) TOPOLOGY: linear 

(iL) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 
20 25 30 

Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 
50 55 60 

Asp Arg Phe Thr He Ser Thr Asp Lys Ser Lys Ser Thr Ala Phe Leu 
65 70 75 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys Ala 
85 90 95 

Arq Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 HO 

Thr Leu Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gin Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 io 15 

Ser Leu Arg Leu Ser Cys Ser Ala Ser Gly Tyr Thr Phe Thr Arg Tyr 



Thr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp He 
35 40 45 

Gly Tyr He Asn Pro Ser Arg Gly Tyr Thr Asn Tyr Asn Lys Val Lys 



50 
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Asp Arg Phe Thr lie Ser Thr Asp Lys Ser Lys Asn Thr Ala Phe Leu 
65 70 "7* 80 

Gin Met Asp Ser Leu Arg Pro Glu Asp Thr <Sly Va* r Tyr Phe Cys Ala 
85 90 95 

Arg Tyr Tyr Asp Asp His Tyr Cys Leu Asp Tyr Trp Gly Gin Gly Thr 
100 105 HO 

Thr Leu Thr Val Ser Ser 
. 115 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDKESS : single 
16 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gin Val Gin Leu Val Glu Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
15 10 15 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe He Phe Ser Ser Tyr 
20 25 30 

Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala He lie Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
50 55 60 

30 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lys Asn Thr Leu Phe 
65 70 75 80 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
85 90 95 

35 Ala Arg Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser cys Phe Gly 

100 105 no 

Pro Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser Ser 
115 120 125 

40 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
^ (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gin lie Val Leu Thr Gin Ser Pro Ala He Met Ser Ala Ser Pro Gly 
1 5 10 15 
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Glu Lvs Val Thr Met Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 
- 1 20 25 30 

Asn Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Acq Trp He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ala His Phe Arg Gly Ser 
^ 50 55 60 

Gly Ser Gly Thr Ser Tyr Ser Leu Thr He Ser Gly Met Glu Ala Glu 
65 70 75 80 

Asp Ala Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 

Phe Gly Ser Gly Thr Lys Leu Glu He Asn Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRAW DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
1 5 10 15 

Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 
20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Glv Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 

40 Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 

100 105 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 
45 (B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



35 



SO 



(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
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Gin lie Val Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
• 1 - 5 10 15 

Asp Arg Val Thr lie Thr cys Ser Ma Ser Ser Ser Val Ser Tyr Met 
20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp lie Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr lie Ser Ser Leu Gin Pro Glu 
65 70 . 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
20 (a) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



10 



15 



30 



35 



40 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 31: 



Gin He Val Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
! 5 io 15 

Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 

20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro Glu 

65 70 75 80 

Asd He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 



Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 
100 105 

45 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser r,eu S^rCAla Se.*v \Ff?. Gly 
1 5 lu It 

Asp Arg Val Thr He Thr Cys Ser Ala Ser Ser Ser Val Ser Tyr Met 
20 25 30 

Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Arg Trp He Tyr 
35 40 45 

Asp Thr Ser Lys Leu Ala Ser Gly Val Pro Ser Arg Phe Ser Gly Ser 
50 55 60 

Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro Glu 
65 70 75 80 

Asp He Ala Thr Tyr Tyr Cys Gin Gin Trp Ser Ser Asn Pro Phe Thr 
85 90 95 

Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asp He Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

35 

Asp Arg Val Thr lie Thr Cys Gin Ala Ser Gin Asp lie He Lys Tyr 
20 25 30 

Leu Asn Trp Tyr Gin Gin Thr Pro Gly Lys Ala Pro Lys Leu Leu He 
40 35 40 45 

Tyr Glu Ala Ser Asn Leu Gin Ala Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Asp Tyr Thr Phe Thr He Ser Ser Leu Gin Pro 
«5 65 70 75 80 

Glu Asp He Ala Thr Tyr Tyr Cys Gin Gin Tyr Gin Ser Leu Pro Tyr 
85 90 95 

Thr Phe Gly Gin Gly Thr Lys Leu Gin He Thr Arg 
50 100 105 



Claims 

1. A CDR-gratted antibody heavy chain having a variable region domain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 6. 23 and/or 24, 48 and/or 49. 71 and/or 73, 75 and/or 76 and/or 78 and 88 and/or 91. 
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2. A CDR-grafted heavy chain according to Claim 1 comprising donor residues at positions 23. 24. 49. 71. 
73 and 78. or at positions 23, 24 and 48. 

3. A CDR-grafted heavy chain according to Claim 2 comprising donor residues at positions 2 t 4, 6, 25, 36. 
37, 39. 47. 48. 93. 94. 103. 104. 106 and 107. 

4. A CDR-grafted heavy chain according to Claim 2 or 3, comprising donor residues at one, some or all of 
positions: 

1 and 3. 

69 (if 48 is different between donor and acceptor). 

38 and 46 (if 48 is the donor residue). 

67, 

82 and 18 (if 67 is the donor residue), 
91, and 

any one or more of 9, 11,41.87, 108, 110 and 112. 

5. A CDR-grafted heavy chain according to any of the preceding comprising donor CDRs at positions 26- 

35, 50-65 and 95-100. 

6. A CDR-grafted antibody light chain having a variable region domain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 1 and/or 3 and 46 and/or 47. 

7. A CDR-grafted Hght chain according to Claim 6 comprising donor residues at positions 46 and 47. 

8. A CDR-grafted antibody light chain having a variable region domain comprising acceptor framework 
and donor antigen binding regions wherein the framework comprises donor residues at at least one of 
positions 46. 48. 58 and 71 . 

9. A CDR-grafted light chain according to Claim 8 comprising donor residues at positions 46, 48, 58 and 
71. 

10. A CDR-grafted light chain according to Claim 8 or 9, comprising donor residues at positions 2, 4, 6. 35. 

36. 38. 44, 47, 49, 62, 64-69, 85, 87, 98. 99. 101 and 102. 

11. A CDR-grafted light chain according to Claim 9 or 10, comprising donor residues at one. some or all of 
positions: 

1 and 3. 
63. 

60 (if 60 and 54 are able to form a potential saltbridge), 
70 (if 70 and 24 are able to form a potential saltbridge). 
73 and 21 (if 47 is different between donor and acceptor), 
37 and 45 (if 47 if different between donor and acceptor), and 
any one or more of 10, 12. 40. 83. 103 and 105. 

12. A CDR-grafted light chain according to any one of Claims 6-11. comprising donor CDRs at positions 
24*34, 50-56 and 89-97. 

13l A CDR-grafted antibody molecule comprising at least one CDR-grafted heavy chain according to any 
one of Claims 1-5 and at least one CDR-grafted light chain according to any one of Claims 6-12. 

14. A CDR-grafted antibody molecule according to Claim 13. which is a site-specific antibody molecule. 

15. A CDR-grafted antibody molecule according to Claim 13 which has specificity for an interleuWn, 
hormone or other biologically active compound or a receptor therefor. 

16. A CDR-grafted antibody heavy or fight chain or molecule according to any one of the preceding claims 
comprising human acceptor residues and non-human donor residues. 
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17. A DNA sequence which codes for a CDR-grafted heavy chain according to Claim 1 or a CDR-grafted 
light chain according to Claim 6 or Claim 8. 

18w A cloning or expression vector containing a DNA sequence according to Claim 17. 

19. A host ceil transformed with a DNA sequence according to Claim 17. 

20. A process for the production of a CDR-grafted antibody sequence according to Claim 17 in a 
transformed host cell. 

21. A process for producing a CDR-grafted antibody product comprising: 

(a) producing in an expression vector an operon having a DNA sequence which encodes an antibody 
heavy chain according to Claim 1; 
and/or 

(b> producing in an expression vector an operon having a DNA sequence which encodes a 
complementary antibody light chain according to Claim 6 or Claim 8; 

(c) transacting a host cell with the or each vector; 
and 

(d) culturing the transfected cell line to produce the CDR-grafted antibody product 

22. A therapeutic or diagnostic composition comprising a CDR-grafted antibody heavy chain according to 
Claim 1, or a CDR-grafted light chain according to Claim 6 or Claim a or a CDft-grafted antibody 
molecule according to Claim 13 in combination with a pharmaceuticaHy acceptable carrier, diluent or 
excipient 

23. A method of therapy or diagnosis comprising administering an effective amount of a CDR-grafted heavy 
chain according to Claim 1. or a CDR-grafted light chain according to Claim 6 or Claim 8, or a CDR- 
grafted antibody molecule according to Claim 13 to a human or animal subject 
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1 GAATTCCCAA AGACAA Aatcr aattttcaaa tacacratttt: cagcttcctcr 

51 ctaatcacrta cctcacrtcat aataUccaaa gga caaattc ttctcaccca 

101 gtctccagca atcatgtctg catctccagg ggagaaggtc accatgacct 

151 gcagtgccag ctcaagtgta agttacatga actggtacca gcagaagtca 

201 ggcacctccc cc&aaagatg gatttatgac acatccaaac tggcttctgg 

.251 agtccctgct cacttcaggg gcagtgggtc tgggacctct tactctctca 

301 caatcagcgg catggaggct gaagatgctg ccacttatta ctgccagcag 

351 tggagtagta acccattcac gttcggctcg gggacaaagt tggaaataaa 

401 ccgggctgat actgcaccaa ctgtatccat cttcccacca tccagtgagc 

451 agttaacatc tggaggtgcc tcagtcgtgt gcttcttgaa caactrtetac 

501 cccaaagaca tcaatgtcaa gtggaagatt gatggcagtg aacgacaaaa 

551 tggcgtcctg aacagttgga ctgatcagga cagcaaagac agcacctaca 

601 gcatgagcag caccctcacg ttgaccaagg acgagtatga acgacataac 

651 agctatacct gtgaggccac tcacaagaca tcaacrttcac ccattgtcaa 

701 gagcttcaac aggaatgagt gtTAGAGACA AAGGTCCTGA GACGCCACCA 

751 CCAGCTCCCA GCTCCATCCT- ATCTTCCCTT CTAAGGTCTT GGAGGCTTCC 

801 CCACAAGCGC tTACCACTGT TGCGGTGCTC tAAACCTCCT CCCACCTCCT 

851 TCTCCTCCTC CTCCCTTTCC TTGGCTTTTA TCATGCTAAT ATTTGCAGAA 

901 AATATTCAAT AAAGTGAGTC TTTGCCTTGA AAAAAAAAAA AAA 



1 HDFOVOirSF LLISASVIIS RG OIVLTOSP AIMSASPGEK VTMTCSASSS 

51 VSYMNWYQQK SGTSPKRWIY DTSKLA5GVP AHFRGSGSGT SYSLTISGME 

101 AEDAATYYCQ QWSSNPFTFG SGTKLEINKA DTAPTVSIFP PSSEQLTSGG 

151 ASWCFLNNF YPKDIKVKWK IDGSERQNGV LNSWTDQDSK DSTYSMSSTL 

201 TLTKDEYERH NSYTCEATHK TSTSPIVKSF NRKEC* 



Fig. 1(a) 



Fig. Kb) 
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1 GAATTCCCCT CTCCACAGAC ACTGAAAACT CTGACTCAAC ATGGAAAGGC 
51 ACTGGATCTT TCTACTCCTG TTGTCAGTAA CTGCAGGTGT CCACTCCCAG 
101 GTCCAGCTGC AGCAGTCTGG GGCTGAACTG GCAAGACCTG GGGCCTCAGT 
151 GAAGATGTCC TGCAAGGCTT CTGGCTACAC CTTTACTAGG TACACGATGC 
201 ACTGGGTAAA ACAGAGGCCT GGACAGGGTC TGGAATGGAT TGGATACATT 
251 AATCCTAGCC GTGGTTATAC TAATTACAAT CAGAAGTTCA AGGACAAGGC 
301 CACATTGACT ACAGACAAAT CCTCCAGCAC AGCCTACATG CAACTGAGCA 
351 GCCTGACATC TGAGGACTCT GCACTCTATT ACTGTGCAAG ATATTATGAT 
401 GATCATTACT GCCTTGACTA CTGGGGCCAA GGCACCACTC TCACAGTCTC 
451 CTCAGCCAAA ACAACAGCCC CATCGGTCTA TCCACTGGCC CCTGTGTGTG 
501 GAGATACAAC TGGCTCCTCG GTGACTCTAG GATGCCTGGT CAAGGGTTAT 
551 TTCCCTGAGC CAGTGACCTT GACCTGGAAC TCTGGATCCC TGTCCAGTGG 
601 TGTGCACACC TTCCCAGCTG TCCTGCAGTC TGACCTCTAC ACCCTCAGCA 
651 GCTCAGTGAC TGTAACCTCG AGCACCTGGC CCAGCCAGTC CATCACCTGC 
701 AATGTGGCCC ACCCGGCAAG CAGCACCAAG GTGGACAAGA AAATTGAGCC 
751 CAGAGGGCCC ACAATCAAGC CCTGTCCTCC ATGCAAATGC CCAGCACCTA 
801 ACCTCTTGGG TGGACCATCC GTCTTCATCT TCCCTCCAAA GATCAAGGAT 
851 GTACTCATGA TCTCCCTGAG CCCCATAGTC ACATGTGTGG TGGTGGATGT 
901 GAGCGAGGAT GACCCAGATG TCCAGATCAG CTGGTTTGTG AACAACGTGG 
951 AAGTACACAC AGCTCAGACA CAAACCCATA GAGAGGATTA CAACAGTACT 
1001 CTCCGGGTGG TCAGTGCCCT CCCCATCCAG CACCAGGACT GGATGAGTGG 
1051 CAAGGAGTTC AAATGCAAGG TCAACAACAA AGACCTCCCA GCGCCCATCG 
1101 AGAGAACCAT CTCAAAACCC AAAGGGTCAG TAAGAGCTCC ACAGGTATAT 
1151 GTCTTGCCTC CACCAGAAGA AGAGATGACT AAGAAACAGG TCACTCTGAC 
1201 CTGCATGGTC ACAGACTTCA TGCCTGAAGA CATTTACGTG GAGTGGACCA 
1251 ACAACGGGAA AACAGAGCTA AACTACAAGA ACACTGAACC AGTCCTGGAC 
1301 TCTGATGGTT CTTACTTCAT GTACAGCAAG CTGAGAGTGG AAAAGAAGAA 
1351 CTGGGTGGAA AGAAATAGCT ACTCCTGTTC AGTGGTCCAC GAGGGTCTGC 
1401 ACAATCACCA CACGACTAAG AGCTTCTCCC GGACTCCGGG TAAATGAGCT 
1451 CAGCACCCAC AAAACTCTCA GGTCCAAAGA GACACCCACA CTCATCTCCA 
1501 TGCTTCCCTT GTATAAATAA AGCACCCAGC AATGCCTGGG ACCATGTAAA 
1551 AAAAAAAAAA AAAGGAATTC 



Fig. 2(a) 
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OKT 3 HEAVY CHAIN PROTEIN SEQUENCE DEDUCED FROM DNA SEQUENCE 

1 MERHWIFLLL LSVTAGVHS O VQLQQSGAEL ARPGASVKMS CKASGYTFTR 

51 YTMHWVKQRP GQGLEWIGYI NPSRGYTNYN QKFKDKATLT TDKSSSTAYM 

101 QLSSLTSEDS AVYYCARYYD DHYCLDYWGQ GTTLTVSSAK TTAPSVYPLA 

151 PVCGDTTGSS VTLGCLVKGY FPEPVTLTWN SGSLSSGVHT FPAVLQSDLY 

201 TLSSSVTVTS STWPSQSITC NVAHPASSTK VDKKIEPRGP TIKPCPPCKC 

251 PAPNLLGGPS VFIFPPKIKD VLMISLSPIV TCVWDVSED DPDVQISWFV 

301 NNVEVHTAQT QTHREDYNST LRWSALPIQ HQDWMSGKEF KCKVNNKDLP 

351 APIERTISKP KGSVRAPQVY VLPPPEEEMT KKQVTLTCMV TDFMPEDIYV 

403 EWTNNGKTEL NYKNTEPVLD SDGSYFMYSK LRVEKKNWVE RNSYSCSWH 

4 51 EGLKNHHTTK SFSRTPGK* 

Fig. 2(b) 

1 23 42 

NN N N N N 

RES TYPE SBspSPESssBSbSsSssPSPSPsPSsse*s*p*Pi~ISsSe 
OKt3vl QIVLTQSPAIMSASPGEKVTMTCSASS . SVSYMNWYQQXSGT 

REI DIQMTQSPSSLSASVGDaVTITCQASQDIIKYLNWYQQTPGK 



CDR1 (LOOP) ******* 
CDR1 (KABAT) *********** 



56 85 

N NN 

RES TYPE *lsiPpIeesesssSBEsePsPSBSSEsPspsPsseesSPePb 
Okt3vl SPKRWIYDTSKtASGVPAHFRGSGSGTSYSLTISGMEAEDAAT 
REI APKLLIYEASNLQAGVPSRFSGSGSGTDYTFTISSLQPEDIAT 
? ?? ? ? 

******* CDR2 (LOOP /KABAT) 



102 108 

RES TYPE PiPIPies**iPIIsPPS?SPSS 
Okt3vl YYCQQWSSNPFTFGSGTKLEINR 
REIvl YYCQQYQSLPYTFGQGTIOQITR 



Fig. 3 



****** 
********* 



CDR3 (LOOP) 
CRD 3 (KABAT) 
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NN N 23 26 32 35 N39 43 

SESFs~SBssS~sSS$SpSpSPsPSEbSBssBePiPIpiesss 
QVQIXJQSGAEI^PGASVKMSCKASGYTFTRYTMHWVKQRPGQ 
fiVQLVESGGCVVQP(^SUa*SCfi8SGFIFSSyAKVWVROAPGK 

****** CDR1 (LOOP) 
***** CDR1 (KABAT) 



52a 60 65 N N N 82abc 89 

RES TYPE I Ielppp * ssssssss " ps * pSSsbSpseSsSseSp "pSpsSBssS " ePb 
Okt3Vh GLEWIGYINPSRGYTNTNQKFI^KATLTTDKSSSTAYMQLSSLTSEDSAV 
KOL GLEWVfcl I WDDGSDQH Y ADSVKGRFTISRDKSKNTLELQMDSLI^EDTGV 

?? ? ? ? ? ? 

************ CDR2 (LOOP) 

******************* CDR2 (KABAT) 

92 N 107 113 

RES TYPE PiPIEissssiiisssbibi*EIPIP*spSBSS 

Okt3 vh YYCARYYDDHY . CLDYWGQGTTLTVSS 

KOL YECARDGGHGFCSSASCFGPDYWGQGTPVTVSS 

***************** CRD3 { KABAT/ LOOP ) 

Fig. A 



RES TYPE 

OJct3h 

KOL 
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OKT 3 HEAVY CHAIN CDR GRAFTS 

1. gh341 and derivatives 

x 26 35 39 43 

Okt3vh QVQLQQSGAELARPGASVKMSCKASGYTFTRYTMHWVKQRPGQ 
gH341 QVQLVESGGGWQPGRSUa^CSS?;GYTmYTMHWVRQAPGK JA178 
gH341A QVQLVfiSGGGWQPGRSlJRI.SCTASGYTfTPYTWHWVRQAPGK JA185 



QVQLVfinCCfnH^PTP eT PT trv^gr.VTFTRYTMHWVRQAPGK 
QVQLVfiSGGGWQPGRSIJU^qCASGXEElSmKWVRQAPGK 
QVQLVgSGGGWQPGRSIJ^SCEASGYTnyYTMHWVRQAPGK 

gH341D QVQLVgS(^WQPGRSIJtI^CHSGYTmYTMHW^QAPGK 
gH341* QVQLVgSGGGWQPGRSUU,SCKASGYTFTRYTWHWVRQAPGK JA199 
gH341C QVQLVgSGGGWQPGRSUU^qaSGYTmYTWHWVRQAPGK 



gH341E 
gH341* 
gH341* 



JAX98 
JA207 
JA209 
JA197 



JA184 



gH341* QVQLVgSGGGWQPGRSLPXSCSASGYTrTRY'nWWVRQAPGK JA203 

gH341* QVQLVESGGGWQPGRSIJU.SCSASGYTFTRYTMHWVRQAPGK JA205 

gH3 4 IB QVQLVESGGGWQPGRSUU.SCSS.SGYTrrPYTHHWVRQAPGK 

gH3 4 1* QVQLVgSGGGWQPGRSIJU,SCSASGYTmYTyHWVRQAPGK 

gH341* QVQLVESGGGWQPGRSU^SCSASGYTFTRYTlgtWVRQAPGK 

gH34X* QVQLVgSGCGWQPGRSIJU^CSASGYTfTRYTMHWVRQAPCK 

KOL QVQLVESGGGWQPGRSLRLSCSSSGFIFSSYAMYWVRQAPGK 

Fig. 5(i) 



JA1B3 
JA204 
JA206 
JA208 
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44 50 65 83 

Ok«Vh GLEWICYINPSRGYTNTOQKFKDKATLTrDKSSSTAYMQLSSLT 

gH341 CLEMV AVTNPSRGYTNYNOKFKDR FTISRDN^ JA178 

gH3 4 1A G1^EW^CY ^'^SRGYTtrraOK >nCDRFTISJDgSK^firiiQMDSLR JA165 

gH341E GLE WJGYINPSRGYTNYNOKV KDRFTISyDKSKgTftFLQMDSLR JA198 

gH341* GLEWIGYINPSRGYTNYNQKVKPRFTI STDKSKNTftFLQMDSLR JA207 

gH341* GLE W^YTKPSRGYTWYNOKV KDRFTISRDNSKHTftFLQMDSIJR JA209 

gH341D GLEWJGY I NPSRGYTKYNQKVXPRFTI SXDKSKWTLFLQMDSLR JA197 

gH341* GLE WJiGY I NPSRGYTNYNOKV KPRFTI SRDN SKNTLFLQMPSLR JA199 

gH341C GLEWV AYI NPSRGYTNYNOKFKDR FTI SRDNSKWTLFLQMDSLR JA184 



gHg4I* GLEW ^GYIMPSRGYTNYKQyV KDRFTI STDKSKgTAFLQKDSLR JA207 

J gH341* GLEW Y XNPSRG YTN YNOKV KDRFTI STDff SKSTft FLQMPSLR JA205 

gH341B GLE WTjgYTNPSRGYTNYNOKV KDRFTIS^DySKgTftFLQMDSLR JA183 

gH341* GI.E WTGYTWPSRGYTWYNOK VKPRFTISyD|CSKgTfiFLOMDSLR JA204 

gH341* GLEW ^IGYI NPSRGYTNYNOKV KPRFTI STDySKSTftFLQMDSLR JA206 

gH341* GLEW jGY I KPSRGYTWYHOK VKDRFTI STPff SKNTftFLQMDSLR JA208 

KOL GLEWVAI IWDDGSDQH Y ADS VKGRFT I SRDNSKNTLFLQMDSLR 



Fig.5(ii) 
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84 95 


102 113 




Okt3vh 


SEDSAVYYCARYYDDHY . . 






gH341 


PEDTGVYFCARYYDDHY. . 


^^JELDYWGQGTTLTVSS 


vAX / O 


gH341A 


PEDTGVYFCARYYDDHY . . 


CLDYWGOCTTT/rVSS 


JA185 


gH341E 


PEDTGVYFCARYYDDHY. . 


. CLDYWCOGTTLTVSfi 


JA198 


gH341* 


PEDTGVYFCARYYDDHY. . 


Cf^YWGOCTTI/rVM 


JA207 


gH341D 


PEDTGVYFCARYYDDHY. . 


...... CLDYWGOGTTL.TVS?: 


JA197 


gH341* 


PEDTGVYFCARYYDDHY . . 


CJ f DYHCOGTTI.TV.c:<: 


JA209 


gH341* 


PEDTGVYFCARYYDDHY . . 


^DYWGOGTTI/rVM 


JA199 


gH341C 


PEDTGVYFCARYYDDHY. . 




JA184 


gH341* 


PEDT^VYYCARYYDDHY . . 


CLDYWGOGTTLTVc;* 


JA2Q3 


gH341* 


PEDTftVYXCARYYDDHY . . 




JA205 


gH341B 


PEDT£VYYCARYYDDHY. . 




JA183 


gH341* 


PEDTGVYFCARYYDDHY . . 


. CLDYWGOGTTI/TVS «; 


JA204 


gH341* 


PEDTGVYFCARYYDDHY. . 




JA206 


gH341* 


PEDTGVYFCARYYDDHY. . 


* •* t t t C^DYWGQGTTLTVSS 


JA208 


KOL 


PEDTGVYFCARDGGHGFCSSASCFGPDYWGQGTPVTVSS 





Fig. 5(iii) 
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OKT3 LIGHT CHAIN CDR GRAFTING 
1. gL221 and derivatives 

1 24 34 42 

QIVLTQSPAIMSASPGEKVTHTCSASS . SVSYMNWYQQKSGT 
nTqOTYjgpggT AJISVCDRVTIT CSASS - SVSYMNW YQQTPGK 
VKTOfiPSST.SAgVCDRVTIT CSASS . SVSYMNWYQQTPGK 
QTWrQgpggT^ASVGDRVTIT CSASS . SVSYMNW YQQTPGK 
nTflMTflgpggT.S*SVGDRVTTTCSASS , SVSYMNWYQQTPGK 
DIQMTQSPSSLSASVGDRVTITCQASQDI IXYLNWYQQTPGK 

43 50 56 85 

SPKRWIYDTSKLASGVPAHFRGSGSGTSYSLTISGMEAEDAAT 
APXLLTY DTSKLAS GVPSRFSGSGSGTDYTFTISSI£PEDIAT 
APKgWIY pTSKLAS GVPSRFSGSGSGTDYTrriSSLQPEDIAT 
APKjWIY DTSKLAS GVPSRFSGSGSGTDYTFTISSLQPEDIAT 
APKRWIY DTSXIiAS GVPSRFSGSGSGTDYTFTISSLQPEDIAT 
APKIXI YEASNLQ AGVP6RFSGSGSGTDYTFTI SSLQPEDI AT 

86 91 96 108 

Okt3vl YYCQQWSSNPFTFCSGTKLEINR 

gL2 21 Y Y CpOWSSNPFTFGQGTKLQITR 

gL2 2 1A YYC OOWSSNPFT FGQGTKLQITR 

gL221B YYC pOWSSNPFT FGQGTKLQITR 

gL2 2 1C YYCOQWSSNPFTFGQGTKLQITR 

REI YYCQQVQSLPYTFGQGTKLQITR 

CDR'S ARE UNDERLINED 

FRAMEWORK RESIDUES INCLUDED IN THE GENE ARE DOUBLE 
UNDERLINED 

Fig. 6 
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Okt3vl 

gL221 

gL221A 

gL221B 

gL221C 

REI 



Okt3vl 

gL221 

gL221A 

gL221B 

gL221C 

REI 
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Fig. 9 
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Fig. 10 
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Fig. 11 
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